ouqprint.com
robots.txt

Robots Exclusion Standard data for ouqprint.com

Resource Scan

Scan Details

Site Domain ouqprint.com
Base Domain ouqprint.com
Scan Status Ok
Last Scan2024-11-12T18:29:13+00:00
Next Scan 2024-11-19T18:29:13+00:00

Last Scan

Scanned2024-11-12T18:29:13+00:00
URL https://ouqprint.com/robots.txt
Domain IPs 2a02:4780:84:8e81:a69e:55a3:7da5:269d, 84.32.84.90
Response IP 91.108.100.232
Found Yes
Hash 8dd95f1310c490d67fdbaff4e534b8743b1925fdd79a433706fdd79ecbd416ee
SimHash 002b19c2e672

Groups

*

Rule Path
Disallow /search?searchtext=*
Disallow /disneyid/*
Disallow /assets/static/ads/*
Disallow /cgi
Disallow /xls
Disallow /imp
Disallow /kmail
Disallow /map
Disallow /log
Disallow /gif
Disallow /panel
Disallow /0/
Disallow /promo/
Disallow /abclinks/
Disallow /houseads/
Allow /xmldata/mrss
Allow /xmldata/rss
Allow /xmldata/xmlPodcast
Allow /xmldata/config
Allow /xmldata/feed
Disallow /sendtofriend/
Allow /meta/sitemap
Disallow /meta/
Disallow /staging/
Disallow /test/
Disallow /swen/
Disallow /intro/
Disallow /go/
Disallow /news/go/
Disallow /widgets/
Disallow /vp2/
Disallow /Video/*playerIndex
Disallow /*carousel/
Disallow /*videoLogin?
Disallow /video/browse/
Disallow /*popup?
Disallow /alerts-v1/
Disallow /not-allowed/
Disallow /beta-story-container/*
Disallow /video/embed/*
Disallow /video/amp/embed/*
Disallow /responder

applebot

Rule Path
Allow /
Disallow /private/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap /xmap
sitemap /xmlLatestStories
sitemap /xmlLatestVideos

Comments

  • robots.txt for /
  • Disallow: /xmldata/