agriguruji.in
robots.txt

Robots Exclusion Standard data for agriguruji.in

Resource Scan

Scan Details

Site Domain agriguruji.in
Base Domain agriguruji.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-03-31T19:25:37+00:00
Next Scan 2025-06-29T19:25:37+00:00

Last Successful Scan

Scanned2024-03-07T19:23:11+00:00
URL https://agriguruji.in/robots.txt
Redirect https://www.agriguruji.in/robots.txt
Redirect Domain www.agriguruji.in
Redirect Base agriguruji.in
Domain IPs 104.21.38.239, 172.67.168.211, 2606:4700:3031::ac43:a8d3, 2606:4700:3033::6815:26ef
Redirect IPs 104.21.38.239, 172.67.168.211, 2606:4700:3031::ac43:a8d3, 2606:4700:3033::6815:26ef
Response IP 172.67.168.211
Found Yes
Hash 2d8246b9b8cd3263fbf095a87e6f557d982f966902110e12f652d17003a94137
SimHash 614993055597

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://www.agriguruji.in/sitemap.xml

Comments

  • robots.txt generated by smallseotools.com