bnd.newspapers.com
robots.txt

Robots Exclusion Standard data for bnd.newspapers.com

Resource Scan

Scan Details

Site Domain bnd.newspapers.com
Base Domain newspapers.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-17T06:51:33+00:00
Next Scan 2024-07-16T06:51:33+00:00

Last Successful Scan

Scanned2023-05-31T06:34:28+00:00
URL https://bnd.newspapers.com/robots.txt
Domain IPs 104.16.207.8, 104.16.208.8, 2606:4700::6810:cf08, 2606:4700::6810:d008
Response IP 104.16.208.8
Found Yes
Hash caacf2ff90b06ac6dd79a281580ff795b60f8407e524f427821b96717c372d2c
SimHash 500eaf41d17f

Groups

*

Rule Path
Disallow /busy.html
Disallow /error.html
Disallow /error.php
Disallow /download/
Disallow /clippings/download/
Allow /newspage/

ahrefsbot

Rule Path
Disallow /busy.html
Disallow /error.html
Disallow /error.php

googlebot-image

Rule Path
Allow /*

applebot

Rule Path
Allow /*

facebot

Rule Path
Allow /*

Comments

  • Slow Bots see https://ahrefs.com/robot for more info
  • Updated 2/25/2019