news.wgcu.org
robots.txt
Robots Exclusion Standard data for news.wgcu.org
Resource Scan
Scan Details
Site Domain | news.wgcu.org |
Base Domain | wgcu.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-08-23T17:05:29+00:00 |
Next Scan | 2025-09-22T17:05:29+00:00 |
Last Successful Scan
Scanned | 2025-07-02T16:55:47+00:00 |
URL | https://news.wgcu.org/robots.txt |
Domain IPs | 13.35.37.26, 13.35.37.44, 13.35.37.74, 13.35.37.82 |
Response IP | 13.35.37.26 |
Found | Yes |
Hash | f60428df6bcfdc65e2c45da839a737a7db377e7b98d895cb5683415ea87b6f57 |
SimHash | f4844504cc53 |
Groups
*
Rule | Path |
---|---|
Disallow | /login |
Disallow | /all-tv-shows |
Disallow | /live-tv |
Disallow | /profile |
Disallow | /auth** |
Disallow | /shows/** |
Other Records
Field | Value |
---|---|
sitemap | https://news.wgcu.org/sitemap.xml |