linux-blog.org
robots.txt

Robots Exclusion Standard data for linux-blog.org

Resource Scan

Scan Details

Site Domain linux-blog.org
Base Domain linux-blog.org
Scan Status Ok
Last Scan2025-09-19T04:16:37+00:00
Next Scan 2025-09-26T04:16:37+00:00

Last Scan

Scanned2025-09-19T04:16:37+00:00
URL http://linux-blog.org/robots.txt
Domain IPs 104.237.130.63
Response IP 104.237.130.63
Found Yes
Hash 4b15caf26ac8bbe28f6508459ddf66f75c83c125092b57dd2d80622a0d0550d4
SimHash 2c016510c2f0

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$

Comments

  • Disallow all directories and files within
  • Disallow all files ending with these extensions