owntv.ca
robots.txt

Robots Exclusion Standard data for owntv.ca

Resource Scan

Scan Details

Site Domain owntv.ca
Base Domain owntv.ca
Scan Status Ok
Last Scan2024-09-28T18:50:35+00:00
Next Scan 2024-10-05T18:50:35+00:00

Last Scan

Scanned2024-09-28T18:50:35+00:00
URL https://owntv.ca/robots.txt
Redirect https://www.oprah.com:443/robots.txt
Redirect Domain www.oprah.com
Redirect Base oprah.com
Domain IPs 18.161.97.10, 18.161.97.32, 18.161.97.47, 18.161.97.75
Redirect IPs 118.215.84.243
Response IP 23.15.106.153
Found Yes
Hash 2e47e6f99c0b000d2234a2c34c68f2bb0c518762d5110060fc59f82a07467bfd
SimHash a9429b125593

Groups

*

Rule Path
Allow /
Disallow /*preview%3D*
Disallow /search*
Disallow /*print%3D*
Disallow /profile*
Disallow *plug_id*
Disallow *messageID*
Disallow *searchID*
Disallow *pollid*
Disallow *cmd*
Disallow *servlet*
Disallow /app/brene-brown-on-demand-special-offerone.html
Disallow /app/brene-brown-on-demand-special-offertwo.html
Disallow /app/brene-brown-special-offer.html
Disallow /app/arianna-huffington-thrive.html
Disallow /app/thrivespecialoffer.html
Disallow /app/thriveaccenture.html
Disallow /app/thrivehuffingtonpost.html
Disallow /app/thrive-linkedin.html
Disallow /app/thought-industries-*login.html*
Disallow /shared/ad-enablers/*
Disallow /shared/ad-iframe-busting-enablers/*

adsbot-google

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.oprah.com/sitemap.xml.gz