scoop.co.nz
robots.txt

Robots Exclusion Standard data for scoop.co.nz

Resource Scan

Scan Details

Site Domain scoop.co.nz
Base Domain scoop.co.nz
Scan Status Ok
Last Scan2024-05-15T04:47:57+00:00
Next Scan 2024-05-22T04:47:57+00:00

Last Scan

Scanned2024-05-15T04:47:57+00:00
URL https://scoop.co.nz/robots.txt
Redirect https://www.scoop.co.nz:443/robots.txt
Redirect Domain www.scoop.co.nz
Redirect Base scoop.co.nz
Domain IPs 13.35.18.112, 13.35.18.114, 13.35.18.87, 13.35.18.99
Redirect IPs 108.157.254.122, 108.157.254.75, 108.157.254.83, 108.157.254.95
Response IP 108.157.254.83
Found Yes
Hash d7a4de0c463324885661f1dbbfb10b5e3879467dccdf64b11efb0a8a82c9c1bd
SimHash 013ddb698103

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

yahoo

Rule Path
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

Other Records

Field Value
crawl-delay 60

slurp

Rule Path
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

Other Records

Field Value
crawl-delay 5

msnbot

Rule Path
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

Other Records

Field Value
crawl-delay 10

jeeves

Rule Path
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

Other Records

Field Value
crawl-delay 10

msiecrawler

Rule Path
Disallow /stories/
Disallow /archive/scoop/stories
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

ms search 4.0 robot

Rule Path
Disallow /stories/
Disallow /archive/scoop/stories
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

site server 3.0 robot

Rule Path
Disallow /stories/
Disallow /archive/scoop/stories/
Disallow /cgi-bin/
Disallow /myscoop/
Disallow /stories/print.html
Disallow /stories/email/
Disallow /share/
Disallow /xl

morning paper

Rule Path
Disallow /

npbot

Rule Path
Disallow /

art-online.com

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

aipbot

Rule Path
Disallow /

maxamine.com--robot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.scoop.co.nz/sitemap.xml

Comments

  • For more info consult:
  • http://info.webcrawler.com/mak/projects/robots/norobots.html