theknowledge.com
robots.txt

Robots Exclusion Standard data for theknowledge.com

Resource Scan

Scan Details

Site Domain theknowledge.com
Base Domain theknowledge.com
Scan Status Ok
Last Scan2025-11-06T20:38:29+00:00
Next Scan 2025-12-06T20:38:29+00:00

Last Scan

Scanned2025-11-06T20:38:29+00:00
URL https://theknowledge.com/robots.txt
Domain IPs 104.16.243.55
Response IP 104.16.243.55
Found Yes
Hash 3ea7464a6aa4cb3f3ebbd6c18e75a5c43b09ffb53b00e10cb3c9e9446f6e8875
SimHash 6f1ddce0af00

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://theknowledge.com/sitemap.xml