rdnews.com.br
robots.txt
Robots Exclusion Standard data for rdnews.com.br
Resource Scan
Scan Details
Site Domain | rdnews.com.br |
Base Domain | rdnews.com.br |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-08-22T17:17:54+00:00 |
Next Scan | 2024-11-20T17:17:54+00:00 |
Last Successful Scan
Scanned | 2024-04-25T13:50:00+00:00 |
URL | https://rdnews.com.br/robots.txt |
Redirect | https://www.rdnews.com.br/robots.txt |
Redirect Domain | www.rdnews.com.br |
Redirect Base | rdnews.com.br |
Domain IPs | 104.21.26.37, 172.67.135.89, 2606:4700:3032::ac43:8759, 2606:4700:3036::6815:1a25 |
Redirect IPs | 104.21.26.37, 172.67.135.89, 2606:4700:3032::ac43:8759, 2606:4700:3036::6815:1a25 |
Response IP | 172.67.135.89 |
Found | Yes |
Hash | ce0407218c9181db5bdb60b8f093a200bc3773b97470072896d8fd724d16cf25 |
SimHash | 63417e732be7 |
Groups
*
Rule | Path |
---|---|
Disallow | /includes/ |
Disallow | /imprime.php |
Disallow | /curtinhas.php?pageNum_Pagina= |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.rdnews.com.br/storage/sitemaps/sitemap-index.xml |
Warnings
- 8 invalid lines.