atlantico.net
robots.txt

Robots Exclusion Standard data for atlantico.net

Resource Scan

Scan Details

Site Domain atlantico.net
Base Domain atlantico.net
Scan Status Ok
Last Scan2024-09-15T05:08:15+00:00
Next Scan 2024-09-22T05:08:15+00:00

Last Scan

Scanned2024-09-15T05:08:15+00:00
URL https://atlantico.net/robots.txt
Redirect https://www.atlantico.net/robots.txt
Redirect Domain www.atlantico.net
Redirect Base atlantico.net
Domain IPs 104.26.14.189, 104.26.15.189, 172.67.69.45, 2606:4700:20::681a:ebd, 2606:4700:20::681a:fbd, 2606:4700:20::ac43:452d
Redirect IPs 104.26.14.189, 104.26.15.189, 172.67.69.45, 2606:4700:20::681a:ebd, 2606:4700:20::681a:fbd, 2606:4700:20::ac43:452d
Response IP 172.67.69.45
Found Yes
Hash 4da491ef724f59fe45af56814d7470374ecf2aa35a973cc2924f8d924c545f29
SimHash 4140d950ca97

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin
Allow /tags/*elecciones-municipales*
Disallow /tag
Disallow /archive
Disallow /content/stats

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.atlantico.net/sitemap.news.xml.gz
sitemap https://www.atlantico.net/sitemap.xml

Comments

  • disallowed AI agents 2024-07