presscorp.org
robots.txt

Robots Exclusion Standard data for presscorp.org

Resource Scan

Scan Details

Site Domain presscorp.org
Base Domain presscorp.org
Scan Status Ok
Last Scan2024-11-13T22:53:49+00:00
Next Scan 2024-11-20T22:53:49+00:00

Last Scan

Scanned2024-11-13T22:53:49+00:00
URL https://presscorp.org/robots.txt
Domain IPs 192.107.243.128, 2602:ff1c:1:120::5
Response IP 192.107.243.128
Found Yes
Hash c7ab3520ee66f39c3bf8fe1928c1eeecffa52fde331c951ac7abce06a440406a
SimHash e900cc08ebd2

Groups

*

Rule Path
Disallow /categories/cat-*
Disallow /index.rss

Other Records

Field Value
sitemap http://presscorp.org/sitemap.xml