ouqprint.com
robots.txt

Robots Exclusion Standard data for ouqprint.com

Resource Scan

Scan Details

Site Domain ouqprint.com
Base Domain ouqprint.com
Scan Status Ok
Last Scan2024-05-31T12:00:05+00:00
Next Scan 2024-06-07T12:00:05+00:00

Last Scan

Scanned2024-05-31T12:00:05+00:00
URL https://ouqprint.com/robots.txt
Domain IPs 2a02:4780:84:7c91:a797:616f:5231:2d91, 77.37.75.166
Response IP 93.127.201.234
Found Yes
Hash 8dd95f1310c490d67fdbaff4e534b8743b1925fdd79a433706fdd79ecbd416ee
SimHash 002b19c2e672

Groups

*

Rule Path
Disallow /search?searchtext=*
Disallow /disneyid/*
Disallow /assets/static/ads/*
Disallow /cgi
Disallow /xls
Disallow /imp
Disallow /kmail
Disallow /map
Disallow /log
Disallow /gif
Disallow /panel
Disallow /0/
Disallow /promo/
Disallow /abclinks/
Disallow /houseads/
Allow /xmldata/mrss
Allow /xmldata/rss
Allow /xmldata/xmlPodcast
Allow /xmldata/config
Allow /xmldata/feed
Disallow /sendtofriend/
Allow /meta/sitemap
Disallow /meta/
Disallow /staging/
Disallow /test/
Disallow /swen/
Disallow /intro/
Disallow /go/
Disallow /news/go/
Disallow /widgets/
Disallow /vp2/
Disallow /Video/*playerIndex
Disallow /*carousel/
Disallow /*videoLogin?
Disallow /video/browse/
Disallow /*popup?
Disallow /alerts-v1/
Disallow /not-allowed/
Disallow /beta-story-container/*
Disallow /video/embed/*
Disallow /video/amp/embed/*
Disallow /responder

applebot

Rule Path
Allow /
Disallow /private/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap /xmap
sitemap /xmlLatestStories
sitemap /xmlLatestVideos

Comments

  • robots.txt for /
  • Disallow: /xmldata/