mainepublic.org
robots.txt

Robots Exclusion Standard data for mainepublic.org

Resource Scan

Scan Details

Site Domain mainepublic.org
Base Domain mainepublic.org
Scan Status Ok
Last Scan2024-06-21T19:57:49+00:00
Next Scan 2024-07-21T19:57:49+00:00

Last Scan

Scanned2024-06-21T19:57:49+00:00
URL https://www.mainepublic.org/robots.txt
Domain IPs 13.33.30.101, 13.33.30.115, 13.33.30.15, 13.33.30.93
Response IP 13.33.30.115
Found Yes
Hash 3072de4892a8fdde1e0f2b7b40849f377a2b2095386caab0621c39f704746179
SimHash 7d448806ae96

Groups

*

Rule Path
Disallow /clip/*
Disallow /auth*
Disallow /shows*
Disallow /login
Disallow /all-tv-shows
Disallow /profile

Other Records

Field Value
sitemap https://www.mainepublic.org/sitemap.xml
sitemap https://www.mainepublic.org/sitemap-latest.xml
sitemap https://www.mainepublic.org/news-sitemap-content.xml