diariotrv.com
robots.txt
Robots Exclusion Standard data for diariotrv.com
Resource Scan
Scan Details
Site Domain | diariotrv.com |
Base Domain | diariotrv.com |
Scan Status | Ok |
Last Scan | 2024-11-02T05:13:51+00:00 |
Next Scan | 2024-11-09T05:13:51+00:00 |
Last Scan
Scanned | 2024-11-02T05:13:51+00:00 |
URL | https://diariotrv.com/robots.txt |
Redirect | https://www.diariotrv.com/robots.txt |
Redirect Domain | www.diariotrv.com |
Redirect Base | diariotrv.com |
Domain IPs | 104.21.57.99, 172.67.162.238, 2606:4700:3036::6815:3963, 2606:4700:3036::ac43:a2ee |
Redirect IPs | 104.21.57.99, 172.67.162.238, 2606:4700:3036::6815:3963, 2606:4700:3036::ac43:a2ee |
Response IP | 172.67.162.238 |
Found | Yes |
Hash | 2685d733419f43e66ca2db8c9db44d545046855e9727c0133c6b50d9d9a9220f |
SimHash | ec204c24e9d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /harming/humans |
Disallow | /ignoring/human/orders |
Disallow | /harm/to/self |
Disallow | /api |
Disallow | /admin |
Other Records
Field | Value |
---|---|
sitemap | https://www.diariotrv.com/sitemap.news.xml.gz |
sitemap | https://www.diariotrv.com/sitemap.xml |