oklahoman.com
robots.txt

Robots Exclusion Standard data for oklahoman.com

Resource Scan

Scan Details

Site Domain oklahoman.com
Base Domain oklahoman.com
Scan Status Ok
Last Scan2024-05-14T06:18:10+00:00
Next Scan 2024-05-21T06:18:10+00:00

Last Scan

Scanned2024-05-14T06:18:10+00:00
URL https://oklahoman.com/robots.txt
Redirect https://www.oklahoman.com/robots.txt
Redirect Domain www.oklahoman.com
Redirect Base oklahoman.com
Domain IPs 151.101.202.62
Redirect IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Response IP 199.232.46.62
Found Yes
Hash bd9d293365eca0d29645cea750fa63c89fc824e120a02a3bddbdd9726e3f5b81
SimHash 6b9e67c7c4c3

Groups

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /story/sponsor-story/
Disallow /picture-gallery/sponsor-story/
Disallow /videos/sponsor-story/
Disallow /longform/sponsor-story/
Disallow /pages/interactives/sponsor-story/
Disallow /interactives/sponsor-story/
Disallow /videos/embed/
Disallow /cgi-bin/
Disallow /block/
Disallow /iphone/
Disallow /ipad/
Disallow /text/
Disallow /xml/
Disallow /5173/
Disallow /flashmediaelement.swf
Disallow /keysearch
Disallow /topic
Disallow /print.php
Disallow /tracking.php

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /errors
Disallow /interactive/
Disallow /userauth/
Disallow /ugc/
Disallow /feeds/
Disallow /services/
Disallow /facebook/
Disallow /version-info/
Disallow /longform/draft/
Disallow /story/draft/
Disallow /topic/*/smart/
Disallow /search
Disallow /module-showcase/
Disallow /newsletter/
Disallow /blended-newsletter/
Disallow /story/nletter/
Disallow /sports/services/photos/
Disallow /optimus
Disallow /ux-train
Disallow /story/advisory/
Disallow /.cam-tangent/
Disallow /pbd/
Disallow /gciaf/

googlebot
googlebot-image
googlebot-video
googlebot-mobile
mediapartners-google
bingbot
twitterbot
facebot
facebookexternalhit
yahoo! slurp
twitterbot
ia_archiver

Rule Path
Disallow /webapi/
Disallow /cgi-bin/
Disallow /block/
Disallow /iphone/
Disallow /ipad/
Disallow /text/
Disallow /xml/
Disallow /5173/
Disallow /flashmediaelement.swf
Disallow /keysearch
Disallow /topic
Disallow /print.php
Disallow /tracking.php

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.oklahoman.com/news-sitemap.xml
sitemap https://www.oklahoman.com/web-sitemap-index.xml
sitemap https://www.oklahoman.com/video-sitemap-index.xml
sitemap https://www.oklahoman.com/sitemap.xml
sitemap https://www.oklahoman.com/sitemaps/newsmap.xml

Comments

  • robots.txt file for https://www.oklahoman.com/