didyouknow.org
robots.txt

Robots Exclusion Standard data for didyouknow.org

Resource Scan

Scan Details

Site Domain didyouknow.org
Base Domain didyouknow.org
Scan Status Ok
Last Scan2024-11-16T14:49:36+00:00
Next Scan 2024-11-23T14:49:36+00:00

Last Scan

Scanned2024-11-16T14:49:36+00:00
URL https://didyouknow.org/robots.txt
Domain IPs 194.1.147.22, 194.1.147.72
Response IP 194.1.147.72
Found Yes
Hash dd380d1a4ca4dfca5ebe4e74b2bbacbbe1e156cafb75bf4e73b90c91ff3fe07f
SimHash 9d4f780cccd1

Groups

mediapartners-google

Rule Path
Disallow

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /cgi-bin/

ia_archiver

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

leech

Rule Path
Disallow /

linguee

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

linkscrawler 0.1beta

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

ranksonicbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

superfeedr bot

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

websnake

Rule Path
Disallow /

Other Records

Field Value
sitemap https://didyouknow.org/sitemap.xml