kleartextbook.com
robots.txt

Robots Exclusion Standard data for kleartextbook.com

Resource Scan

Scan Details

Site Domain kleartextbook.com
Base Domain kleartextbook.com
Scan Status Ok
Last Scan2025-10-26T00:51:08+00:00
Next Scan 2025-11-25T00:51:08+00:00

Last Scan

Scanned2025-10-26T00:51:08+00:00
URL https://kleartextbook.com/robots.txt
Domain IPs 104.21.2.81, 172.67.128.234, 2606:4700:3031::6815:251, 2606:4700:3031::ac43:80ea
Response IP 104.21.2.81
Found Yes
Hash fbe9cf89b8063bb56a62ffe85a889407e0625f9c622a2e882ec83ac27ed6103d
SimHash 4845ca406237

Groups

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google

Rule Path
Disallow

duggmirror

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow /category/*/*
Disallow */trackback/
Disallow */feed/
Disallow */comments/
Disallow /*?
Allow /wp-content/uploads/

Other Records

Field Value
sitemap http://kleartextbook.com/sitemap.xml

Comments

  • Google Image
  • Google AdSense
  • digg mirror
  • global