iheart.com
robots.txt

Robots Exclusion Standard data for iheart.com

Resource Scan

Scan Details

Site Domain iheart.com
Base Domain iheart.com
Scan Status Ok
Last Scan2024-12-20T05:41:29+00:00
Next Scan 2024-12-27T05:41:29+00:00

Last Scan

Scanned2024-12-20T05:41:29+00:00
URL https://iheart.com/robots.txt
Redirect https://www.iheart.com/robots.txt
Redirect Domain www.iheart.com
Redirect Base iheart.com
Domain IPs 151.101.0.65, 151.101.128.65, 151.101.192.65, 151.101.64.65
Redirect IPs 199.232.210.84, 199.232.214.84
Response IP 199.232.46.84
Found Yes
Hash 79534b6a5c55c28e3cc9403703d8bb5e79e1d203f5986f827034015b532e1895
SimHash 690a514564b1

Groups

mediapartners-google

Rule Path
Allow /ads.txt

facebookexternalhit

Rule Path
Disallow /_/
Disallow /a/
Disallow /misc/

*

Rule Path
Disallow /_/
Disallow /a/
Disallow /amp/
Disallow /misc/
Disallow /search/
Disallow /*/itunes/
Disallow /*/amazon/
Disallow /login/
Disallow /userInfo/
Disallow /family-validation/

*
googlebot-news

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.iheart.com/sitemap.xml
sitemap https://www.iheart.com/news_sitemap.xml

Comments

  • Production