gtoaa.org
robots.txt
Robots Exclusion Standard data for gtoaa.org
Resource Scan
Scan Details
| Site Domain | gtoaa.org |
| Base Domain | gtoaa.org |
| Scan Status | Ok |
| Last Scan | 2025-11-21T20:00:45+00:00 |
| Next Scan | 2025-12-21T20:00:45+00:00 |
Last Scan
| Scanned | 2025-11-21T20:00:45+00:00 |
| URL | https://gtoaa.org/robots.txt |
| Domain IPs | 104.21.1.190, 172.67.129.218 |
| Response IP | 104.21.1.190 |
| Found | Yes |
| Hash | bf2db6ec0a3910d0dfb47e3f183939e7152c1f5bfcb592ce1b8b4b7b438391d1 |
| SimHash | 24200f43d1c4 |
Groups
ai2bot
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
claudebot
cohere-ai
cohere-training-data-crawler
duckassistbot
facebookbot
google-extended
googleother
googleother-image
googleother-video
gptbot
img2dataset
meta-externalagent
mycentralaiscraperbot
omgili
omgilibot
quora-bot
tiktokspider
youbot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
*
| Rule | Path |
|---|---|
| Disallow | /config |
| Disallow | /search |
| Disallow | /account$ |
| Disallow | /account/ |
| Disallow | /commerce/digital-download/ |
| Disallow | /api/ |
| Allow | /api/ui-extensions/ |
| Disallow | /static/ |
| Disallow | /*?author=* |
| Disallow | /*%26author%3D* |
| Disallow | /*?tag=* |
| Disallow | /*%26tag%3D* |
| Disallow | /*?month=* |
| Disallow | /*%26month%3D* |
| Disallow | /*?view=* |
| Disallow | /*%26view%3D* |
| Disallow | /*?format=json |
| Disallow | /*%26format%3Djson |
| Disallow | /*?format=page-context |
| Disallow | /*%26format%3Dpage-context |
| Disallow | /*?format=main-content |
| Disallow | /*%26format%3Dmain-content |
| Disallow | /*?format=json-pretty |
| Disallow | /*%26format%3Djson-pretty |
| Disallow | /*?format=ical |
| Disallow | /*%26format%3Dical |
| Disallow | /*?reversePaginate=* |
| Disallow | /*%26reversePaginate%3D* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.gtoaa.org/sitemap.xml |
Comments