wgci.com
robots.txt

Robots Exclusion Standard data for wgci.com

Resource Scan

Scan Details

Site Domain wgci.com
Base Domain wgci.com
Scan Status Ok
Last Scan2024-11-16T16:25:32+00:00
Next Scan 2024-11-23T16:25:32+00:00

Last Scan

Scanned2024-11-16T16:25:32+00:00
URL https://wgci.com/robots.txt
Redirect https://wgci.iheart.com/robots.txt?pname=wgci.com&sc=dnsredirect
Redirect Domain wgci.iheart.com
Redirect Base iheart.com
Domain IPs 107.22.119.212, 34.225.105.235, 54.85.39.84
Redirect IPs 199.232.210.193, 199.232.214.193
Response IP 199.232.46.193
Found Yes
Hash 3baa04e0a4cdce7405b0bfa8805f13f13ed21f0269ffa89bff2e9de5ac7e1018
SimHash b3035cc6efd5

Groups

mediapartners-google*
*

Rule Path
Disallow /search/*
Disallow /calendar/ajaxcall/
Disallow /static/
Disallow /api/*
Disallow /_debug/*
Disallow /_preview
Disallow /eyes-to-ears/*
Disallow /newsletter/embed/*
Disallow /contact/send*
Disallow /text/*
Disallow /alternate/amp/stats.html

Other Records

Field Value
sitemap https://wgci.iheart.com/sitemap.xml