afnoticias.com.br
robots.txt

Robots Exclusion Standard data for afnoticias.com.br

Resource Scan

Scan Details

Site Domain afnoticias.com.br
Base Domain afnoticias.com.br
Scan Status Ok
Last Scan2024-09-21T21:32:00+00:00
Next Scan 2024-09-28T21:32:00+00:00

Last Scan

Scanned2024-09-21T21:32:00+00:00
URL https://afnoticias.com.br/robots.txt
Domain IPs 96.126.108.196
Response IP 96.126.108.196
Found Yes
Hash a53e83947f8c5456e3c8685be656b6c16194fb62ec284890f1e3ac7d77498ca3
SimHash e9004ac6fe93

Groups

*

Rule Path
Disallow /banner
Allow /

proximic

Rule Path
Disallow /

starkbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

voluumdsp-content-bot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

*

Rule Path
Disallow /busca?query=*

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
sitemap https://afnoticias.com.br/sitemap.xml

Warnings

  • 1 invalid line.