didyouknowdaily.com
robots.txt

Robots Exclusion Standard data for didyouknowdaily.com

Resource Scan

Scan Details

Site Domain didyouknowdaily.com
Base Domain didyouknowdaily.com
Scan Status Ok
Last Scan2024-10-04T04:04:22+00:00
Next Scan 2024-10-11T04:04:22+00:00

Last Scan

Scanned2024-10-04T04:04:22+00:00
URL https://didyouknowdaily.com/robots.txt
Redirect https://www.didyouknowdaily.com/robots.txt
Redirect Domain www.didyouknowdaily.com
Redirect Base didyouknowdaily.com
Domain IPs 216.24.57.1
Redirect IPs 216.24.57.252, 216.24.57.4
Response IP 216.24.57.4
Found Yes
Hash ba0bfbbaf390d62887a706a4b262144dbc0d7b084ef112299e3a898445633834
SimHash 1ad0d305d774

Groups

twitterbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

blexbot/1.0

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

linkdexbot/2.1

Rule Path
Disallow /

linkdexbot/2.2

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path
Allow /$
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • Allow Twitterbot in order to read Twitter Cards
  • Allow Google Mediabot for AdSense/AdX
  • Bots
  • Other