notre-planete.info
robots.txt

Robots Exclusion Standard data for notre-planete.info

Resource Scan

Scan Details

Site Domain notre-planete.info
Base Domain notre-planete.info
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-02T02:50:57+00:00
Next Scan 2024-10-09T02:50:57+00:00

Last Successful Scan

Scanned2024-09-24T01:19:25+00:00
URL https://notre-planete.info/robots.txt
Redirect https://www.notre-planete.info/robots.txt
Redirect Domain www.notre-planete.info
Redirect Base notre-planete.info
Domain IPs 154.56.56.187, 2a02:4780:28:204b::1
Redirect IPs 154.56.56.187, 2a02:4780:28:204b::1
Response IP 154.56.56.187
Found Yes
Hash d4838f0b9cc39c28f8f24c8c624cbdb848f8bfdcab9ec2646fd8d939e3c8747a
SimHash 6054c3975355

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /photos/
Disallow /actualites/images/
Disallow /services/membres/photos/

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

qwantify

Rule Path
Allow /