wgcu.org
robots.txt
Robots Exclusion Standard data for wgcu.org
Resource Scan
Scan Details
Site Domain | wgcu.org |
Base Domain | wgcu.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-08-29T20:18:48+00:00 |
Next Scan | 2025-10-28T20:18:48+00:00 |
Last Successful Scan
Scanned | 2025-07-01T10:47:06+00:00 |
URL | https://wgcu.org/robots.txt |
Redirect | https://www.wgcu.org/robots.txt |
Redirect Domain | www.wgcu.org |
Redirect Base | wgcu.org |
Domain IPs | 13.35.202.101, 13.35.202.124, 13.35.202.64, 13.35.202.88 |
Redirect IPs | 13.35.202.101, 13.35.202.124, 13.35.202.64, 13.35.202.88 |
Response IP | 13.35.202.101 |
Found | Yes |
Hash | f60428df6bcfdc65e2c45da839a737a7db377e7b98d895cb5683415ea87b6f57 |
SimHash | f4844504cc53 |
Groups
*
Rule | Path |
---|---|
Disallow | /login |
Disallow | /all-tv-shows |
Disallow | /live-tv |
Disallow | /profile |
Disallow | /auth** |
Disallow | /shows/** |
Other Records
Field | Value |
---|---|
sitemap | https://news.wgcu.org/sitemap.xml |