ato.org
robots.txt

Robots Exclusion Standard data for ato.org

Resource Scan

Scan Details

Site Domain ato.org
Base Domain ato.org
Scan Status Ok
Last Scan2025-08-31T04:02:09+00:00
Next Scan 2025-09-30T04:02:09+00:00

Last Scan

Scanned2025-08-31T04:02:09+00:00
URL https://ato.org/robots.txt
Domain IPs 104.18.185.50
Response IP 104.18.185.50
Found Yes
Hash 81ed58493ab76ff05df0b88bd00bb6c5003df0e0b3c6da2454610f9acf4fef25
SimHash 436e4842e623

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-login.php
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /all-posts
Disallow /trackback
Disallow /comments
Disallow */trackback
Disallow */comments
Allow /wp-admin/admin-ajax.php

bytespider

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

geedoproductsearch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ato.org/sitemap_index.xml

Comments

  • To block Bytespider from crawling:
  • To block Bytedance from crawling:
  • To block Coccocbot from crawling:
  • To block Dotbot from crawling:
  • To block Common Crawl Bot from crawling:
  • To block Common Crawl Bot from crawling:
  • To block GeedoBot from crawling:
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • To block GeedoBot from crawling: