jornalhoraextra.com.br
robots.txt

Robots Exclusion Standard data for jornalhoraextra.com.br

Resource Scan

Scan Details

Site Domain jornalhoraextra.com.br
Base Domain jornalhoraextra.com.br
Scan Status Ok
Last Scan2024-11-16T04:28:09+00:00
Next Scan 2024-11-23T04:28:09+00:00

Last Scan

Scanned2024-11-16T04:28:09+00:00
URL https://jornalhoraextra.com.br/robots.txt
Domain IPs 67.23.238.44
Response IP 67.23.238.44
Found Yes
Hash 38cce1056d58bbf66446502aec99637ca7624b3cf6a5d8ebfc2cbde58370edaa
SimHash 69645c448891

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /*.html$

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.jornalhoraextra.com.br/sitemap.xml