htmlka.com
robots.txt

Robots Exclusion Standard data for htmlka.com

Resource Scan

Scan Details

Site Domain htmlka.com
Base Domain htmlka.com
Scan Status Ok
Last Scan2025-10-31T01:04:35+00:00
Next Scan 2025-11-30T01:04:35+00:00

Last Scan

Scanned2025-10-31T01:04:35+00:00
URL https://htmlka.com/robots.txt
Domain IPs 104.21.31.177, 172.67.178.239, 2606:4700:3033::6815:1fb1, 2606:4700:3035::ac43:b2ef
Response IP 104.21.31.177
Found Yes
Hash f66f25a6ecb10c1336b51574695d4e07bcef751a688722baf1286dc16b3e21fb
SimHash 7d35bb50d7f0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /wp/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow */feed/
Allow */wp-*/*ajax*.php
Allow */wp-sitemap
Allow */uploads
Allow */wp-*/*.js
Allow */wp-*/*.css
Allow */wp-*/*.png
Allow */wp-*/*.jpg
Allow */wp-*/*.jpeg
Allow */wp-*/*.gif
Allow */wp-*/*.svg
Allow */wp-*/*.webp
Allow */wp-*/*.swf
Allow */wp-*/*.pdf

googlebot

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /wp/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow */feed/
Allow */wp-*/*ajax*.php
Allow */wp-sitemap
Allow */uploads
Allow */wp-*/*.js
Allow */wp-*/*.css
Allow */wp-*/*.png
Allow */wp-*/*.jpg
Allow */wp-*/*.jpeg
Allow */wp-*/*.gif
Allow */wp-*/*.svg
Allow */wp-*/*.webp
Allow */wp-*/*.swf
Allow */wp-*/*.pdf

Other Records

Field Value
sitemap https://htmlka.com/sitemap_index.xml