ipiranoticias.com
robots.txt

Robots Exclusion Standard data for ipiranoticias.com

Resource Scan

Scan Details

Site Domain ipiranoticias.com
Base Domain ipiranoticias.com
Scan Status Ok
Last Scan2024-11-06T07:27:17+00:00
Next Scan 2024-11-13T07:27:17+00:00

Last Scan

Scanned2024-11-06T07:27:17+00:00
URL https://ipiranoticias.com/robots.txt
Domain IPs 104.21.10.52, 172.67.131.60, 2606:4700:3033::6815:a34, 2606:4700:3033::ac43:833c
Response IP 172.67.131.60
Found Yes
Hash 06bba99d9f97c79ec94779934ccac2e4cd46d00e042480563fe0ddc42132ac93
SimHash 620098120382

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://ipiranoticias.com/sitemap_index.xml
sitemap https://ipiranoticias.com/sitemap-news.xml