news.wgcu.org
robots.txt

Robots Exclusion Standard data for news.wgcu.org

Resource Scan

Scan Details

Site Domain news.wgcu.org
Base Domain wgcu.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-23T17:05:29+00:00
Next Scan 2025-09-22T17:05:29+00:00

Last Successful Scan

Scanned2025-07-02T16:55:47+00:00
URL https://news.wgcu.org/robots.txt
Domain IPs 13.35.37.26, 13.35.37.44, 13.35.37.74, 13.35.37.82
Response IP 13.35.37.26
Found Yes
Hash f60428df6bcfdc65e2c45da839a737a7db377e7b98d895cb5683415ea87b6f57
SimHash f4844504cc53

Groups

*

Rule Path
Disallow /login
Disallow /all-tv-shows
Disallow /live-tv
Disallow /profile
Disallow /auth**
Disallow /shows/**

Other Records

Field Value
sitemap https://news.wgcu.org/sitemap.xml