newbernlive.org
robots.txt

Robots Exclusion Standard data for newbernlive.org

Resource Scan

Scan Details

Site Domain newbernlive.org
Base Domain newbernlive.org
Scan Status Ok
Last Scan2024-05-20T05:39:56+00:00
Next Scan 2024-06-19T05:39:56+00:00

Last Scan

Scanned2024-05-20T05:39:56+00:00
URL https://newbernlive.org/robots.txt
Domain IPs 104.21.37.185, 172.67.212.77, 2606:4700:3032::ac43:d44d, 2606:4700:3035::6815:25b9
Response IP 104.21.37.185
Found Yes
Hash 99f537c307b55aa3985cfbde06a729f5470a717e3ad572349434121306c163d4
SimHash 8868d290d531

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

slurp

Rule Path
Disallow

msnbot

Rule Path
Disallow

alexabot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

facebookexternalhit

Rule Path
Allow /imgres