centraldenoticiasacarau.com.br
robots.txt

Robots Exclusion Standard data for centraldenoticiasacarau.com.br

Resource Scan

Scan Details

Site Domain centraldenoticiasacarau.com.br
Base Domain centraldenoticiasacarau.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-21T17:56:02+00:00
Next Scan 2024-11-19T17:56:02+00:00

Last Successful Scan

Scanned2024-01-25T17:01:51+00:00
URL http://www.centraldenoticiasacarau.com.br/robots.txt
Domain IPs 2404:6800:4003:c04::79, 74.125.200.121
Response IP 74.125.200.121
Found Yes
Hash 112c7b851128825a344566b00220b6dcfe4348c93f4b81b78c52e5f3dac0e6ad
SimHash 0b0496404f93

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap http://www.centraldenoticiasacarau.com.br/sitemap.xml