worldatlas.com
robots.txt

Robots Exclusion Standard data for worldatlas.com

Resource Scan

Scan Details

Site Domain worldatlas.com
Base Domain worldatlas.com
Scan Status Ok
Last Scan2024-11-02T10:10:39+00:00
Next Scan 2024-11-09T10:10:39+00:00

Last Scan

Scanned2024-11-02T10:10:39+00:00
URL https://worldatlas.com/robots.txt
Redirect https://www.worldatlas.com:443/robots.txt
Redirect Domain www.worldatlas.com
Redirect Base worldatlas.com
Domain IPs 44.205.57.61, 44.223.162.73
Redirect IPs 44.205.57.61, 44.223.162.73
Response IP 44.205.57.61
Found Yes
Hash 4a582d662459fb4ebebd634d84cf0b920abccd4a17da92d330551bcde460b3a1
SimHash 6c483843c7e3

Groups

*

Rule Path
Disallow /search?q=*
Disallow /webimage/countrys/namerica/usstates/printpage/
Disallow /webimage/countrys/samerica/printpage/
Disallow /webimage/countrys/namerica/camerica/printpage/
Disallow /aatlas/infopage/printpage/
Disallow /webimage/countrys/printpage/
Disallow /webimage/countrys/namerica/province/printpage/
Disallow /webimage/countrys/oceania/printpage/
Disallow /webimage/countrys/polar/printpage/
Disallow /webimage/countrys/namerica/caribb/printpage/
Disallow /aatlas/printpage/
Disallow /webimage/countrys/africa/printpage/
Disallow /webimage/countrys/asia/printpage/
Disallow /webimage/countrys/europe/printpage/
Disallow /webimage/countrys/namerica/printpage/

googlebot

Rule Path
Disallow /tfBuster.html

Other Records

Field Value
sitemap https://www.worldatlas.com/webmap/sitemap-index.xml