howtogeek.com
robots.txt

Robots Exclusion Standard data for howtogeek.com

Resource Scan

Scan Details

Site Domain howtogeek.com
Base Domain howtogeek.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-25T07:27:52+00:00
Next Scan 2024-12-24T07:27:52+00:00

Last Successful Scan

Scanned2023-12-01T06:17:48+00:00
URL https://howtogeek.com/robots.txt
Redirect https://www.howtogeek.com/robots.txt
Redirect Domain www.howtogeek.com
Redirect Base howtogeek.com
Domain IPs 3.222.102.97
Redirect IPs 3.222.102.97
Response IP 3.222.102.97
Found Yes
Hash 40ee60d3725a51a5daa063cfdbc277586f315e8eb2018e9459cd344efd062ecd
SimHash 69155d40c113

Groups

*

Rule Path
Disallow /pixel.png*
Disallow /search/
Disallow /profile/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.howtogeek.com/sitemap.xml