pioneerindiya.com
robots.txt

Robots Exclusion Standard data for pioneerindiya.com

Resource Scan

Scan Details

Site Domain pioneerindiya.com
Base Domain pioneerindiya.com
Scan Status Ok
Last Scan2024-09-25T16:02:35+00:00
Next Scan 2024-10-02T16:02:35+00:00

Last Scan

Scanned2024-09-25T16:02:35+00:00
URL https://pioneerindiya.com/robots.txt
Domain IPs 23.213.158.4, 23.213.158.8
Response IP 23.213.158.8
Found Yes
Hash 40d0614fcfc009c30039d0f67e5b3642d03d726ce7f9e2cd5d831fd7e9d93308
SimHash 2901cb72c2d0

Groups

*

Rule Path
Allow /

adsbot-google

Rule Path
Disallow /preview?article

*

Rule Path
Disallow */can/evnt/click*

*

Rule Path
Disallow *?col_ci=*

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://pioneerindiya.com/sitemap.xml
sitemap https://pioneerindiya.com/news-sitemap.xml
sitemap https://pioneerindiya.com/sitemap_index.xml
sitemap https://pioneerindiya.com/category-sitemap.xml
sitemap https://pioneerindiya.com/image-sitemap.xml