danigilbert.com
robots.txt

Robots Exclusion Standard data for danigilbert.com

Resource Scan

Scan Details

Site Domain danigilbert.com
Base Domain danigilbert.com
Scan Status Ok
Last Scan2025-10-07T19:10:47+00:00
Next Scan 2025-11-06T19:10:47+00:00

Last Scan

Scanned2025-10-07T19:10:47+00:00
URL https://danigilbert.com/robots.txt
Redirect https://www.danigilbert.com/robots.txt
Redirect Domain www.danigilbert.com
Redirect Base danigilbert.com
Domain IPs 199.34.228.48
Redirect IPs 199.34.228.48
Response IP 199.34.228.48
Found Yes
Hash d888c0e0d56a9eed2e2922c751063ac4915a793ed4467551c98ed71cdc8285b9
SimHash 2254dc466f93

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /research.html
Disallow /teaching.html

Other Records

Field Value
sitemap https://www.danigilbert.com/sitemap.xml