oprah.com
robots.txt

Robots Exclusion Standard data for oprah.com

Resource Scan

Scan Details

Site Domain oprah.com
Base Domain oprah.com
Scan Status Ok
Last Scan2024-06-22T03:02:36+00:00
Next Scan 2024-06-29T03:02:36+00:00

Last Scan

Scanned2024-06-22T03:02:36+00:00
URL https://oprah.com/robots.txt
Redirect https://www.oprah.com:443/robots.txt
Redirect Domain www.oprah.com
Redirect Base oprah.com
Domain IPs 18.208.61.196, 34.194.86.89
Redirect IPs 104.69.171.192
Response IP 104.76.132.84
Found Yes
Hash 2e47e6f99c0b000d2234a2c34c68f2bb0c518762d5110060fc59f82a07467bfd
SimHash a9429b125593

Groups

*

Rule Path
Allow /
Disallow /*preview%3D*
Disallow /search*
Disallow /*print%3D*
Disallow /profile*
Disallow *plug_id*
Disallow *messageID*
Disallow *searchID*
Disallow *pollid*
Disallow *cmd*
Disallow *servlet*
Disallow /app/brene-brown-on-demand-special-offerone.html
Disallow /app/brene-brown-on-demand-special-offertwo.html
Disallow /app/brene-brown-special-offer.html
Disallow /app/arianna-huffington-thrive.html
Disallow /app/thrivespecialoffer.html
Disallow /app/thriveaccenture.html
Disallow /app/thrivehuffingtonpost.html
Disallow /app/thrive-linkedin.html
Disallow /app/thought-industries-*login.html*
Disallow /shared/ad-enablers/*
Disallow /shared/ad-iframe-busting-enablers/*

adsbot-google

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.oprah.com/sitemap.xml.gz