waghalsaada.com
robots.txt

Robots Exclusion Standard data for waghalsaada.com

Resource Scan

Scan Details

Site Domain waghalsaada.com
Base Domain waghalsaada.com
Scan Status Ok
Last Scan2025-10-12T18:50:51+00:00
Next Scan 2025-11-11T18:50:51+00:00

Last Scan

Scanned2025-10-12T18:50:51+00:00
URL https://waghalsaada.com/robots.txt
Domain IPs 104.21.35.237, 172.67.180.176, 2606:4700:3030::ac43:b4b0, 2606:4700:3035::6815:23ed
Response IP 104.21.35.237
Found Yes
Hash 3d10e92c8cdf2fccaf044c20d9cca98f97a87aa559f6bae49e1f28b499a1ab45
SimHash 7e34d920e2d3

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-json/
Disallow /cgi-bin/
Disallow /search
Disallow /?s=
Disallow /author/
Disallow /tag/
Disallow /feed/
Disallow /trackback/
Disallow /*?orderby=
Disallow /*?filter=
Disallow /*?add-to-cart=
Disallow /*?replytocom=

Other Records

Field Value
sitemap https://waghalsaada.com/sitemap_index.xml