india-herald.com
robots.txt

Robots Exclusion Standard data for india-herald.com

Resource Scan

Scan Details

Site Domain india-herald.com
Base Domain india-herald.com
Scan Status Ok
Last Scan2025-02-14T11:27:01+00:00
Next Scan 2025-03-16T11:27:01+00:00

Last Scan

Scanned2025-02-14T11:27:01+00:00
URL https://india-herald.com/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.48.1
Found Yes
Hash d1ee96030c9743ed1df7442e615086d49f15b312d31ad8b31dbd66bdfd7f5572
SimHash 0844d090d551

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

slurp

Rule Path
Disallow

msnbot

Rule Path
Disallow

alexabot

Rule Path
Disallow

twitterbot

Rule Path
Disallow