webhat.in
robots.txt

Robots Exclusion Standard data for webhat.in

Resource Scan

Scan Details

Site Domain webhat.in
Base Domain webhat.in
Scan Status Ok
Last Scan2025-04-19T20:31:04+00:00
Next Scan 2025-04-26T20:31:04+00:00

Last Scan

Scanned2025-04-19T20:31:04+00:00
URL https://webhat.in/robots.txt
Domain IPs 104.21.43.38, 172.67.218.222, 2606:4700:3032::ac43:dade, 2606:4700:3034::6815:2b26
Response IP 172.67.218.222
Found Yes
Hash ad6c338901a4f5d00392a20495d5c304c4a0ce6dfcea245b236d5681b8e5cbca
SimHash 7b8a08505d91

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/themes/
Disallow /wp-content/uploads/email-tpl/
Allow /wp-includes/js/
Allow /wp-includes/css/dist/block-library/style.min.css
Allow /wp-admin/admin-ajax.php
Allow /wp-content/themes/blocksy/static/bundle/
Allow /wp-content/themes/blocksy/static/fonts/
Allow /wp-content/themes/blocksy/static/images/
Allow /wp-content/themes/blocksy/static/js/
Allow /wp-content/themes/blocksy-child/style.css
Allow /wp-content/themes/blocksy-child/js/
Allow /wp-content/themes/blocksy-child/css/

Other Records

Field Value
sitemap https://www.webhat.in/sitemap.xml