terradotta.com
robots.txt

Robots Exclusion Standard data for terradotta.com

Resource Scan

Scan Details

Site Domain terradotta.com
Base Domain terradotta.com
Scan Status Ok
Last Scan2024-09-04T03:35:21+00:00
Next Scan 2024-10-04T03:35:21+00:00

Last Scan

Scanned2024-09-04T03:35:21+00:00
URL https://terradotta.com/robots.txt
Redirect https://www.terradotta.com/robots.txt
Redirect Domain www.terradotta.com
Redirect Base terradotta.com
Domain IPs 35.165.100.40
Redirect IPs 35.165.100.40
Response IP 35.165.100.40
Found Yes
Hash ccdc3176ff993975d7cfa30175655259fc16d13c8cb71cc7e34b5ab477682fa6
SimHash 2a3c9b52c7d3

Groups

*

Rule Path
Disallow /rservice/
Disallow /27panels/
Disallow /backup/
Disallow /_thumbs/
Disallow /errors/
Disallow /*.php$

baiduspider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.terradotta.com/sitemap.xml

Comments

  • robots.txt file for this website
  • addresses all robots by using wild card *
  • User-agent: *
  • list folders robots are not allowed to index
  • End of robots.txt file