annadatha.in
robots.txt

Robots Exclusion Standard data for annadatha.in

Resource Scan

Scan Details

Site Domain annadatha.in
Base Domain annadatha.in
Scan Status Ok
Last Scan2025-12-19T08:47:33+00:00
Next Scan 2026-01-18T08:47:33+00:00

Last Scan

Scanned2025-12-19T08:47:33+00:00
URL https://annadatha.in/robots.txt
Domain IPs 104.21.33.79, 172.67.160.4, 2606:4700:3034::6815:214f, 2606:4700:3037::ac43:a004
Response IP 172.67.160.4
Found Yes
Hash 52973ae800c37bc626d9cb34a4a2962ffade655d963ec767cd41c55efc969cb8
SimHash 25b0b97387b8

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow /wp/
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed
Disallow *openstat%3D
Disallow */rss
Disallow */embed
Disallow /xmlrpc.php
Allow */uploads

googlebot

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow /wp/
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow /xmlrpc.php
Disallow */trackback
Disallow */feed
Disallow */rss
Disallow */embed
Disallow */wlwmanifest.xml
Disallow *utm*%3D
Disallow *openstat%3D
Allow */uploads
Allow /*/*.js
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /wp-*.jpeg
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.gif

Other Records

Field Value
sitemap https://annadatha.in/sitemap_index.xml
sitemap https://annadatha.in/page-sitemap.xml