database.earth
robots.txt

Robots Exclusion Standard data for database.earth

Resource Scan

Scan Details

Site Domain database.earth
Base Domain database.earth
Scan Status Ok
Last Scan2024-10-28T05:29:20+00:00
Next Scan 2024-11-04T05:29:20+00:00

Last Scan

Scanned2024-10-28T05:29:20+00:00
URL https://database.earth/robots.txt
Domain IPs 138.199.46.68, 2400:52e0:1500::868:1
Response IP 138.199.46.68
Found Yes
Hash 125587494420990b84a55eba3d4e330140d4b90a4f8e4efd4ef2f9d5ec8f28b1
SimHash 891c9860e190

Groups

semrushbot
ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

imagesiftbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3600

amazonbot
bytespider
gptbot
mj12bot
dataforseobot
claudebot

Rule Path
Disallow /

*

Rule Path
Disallow /test
Disallow /statusz
Disallow /googled7d15cf641a3c612.html

Other Records

Field Value
sitemap https://database.earth/sitemap.xml