gloireadieu.com
robots.txt

Robots Exclusion Standard data for gloireadieu.com

Resource Scan

Scan Details

Site Domain gloireadieu.com
Base Domain gloireadieu.com
Scan Status Ok
Last Scan2024-06-11T18:04:59+00:00
Next Scan 2024-06-18T18:04:59+00:00

Last Scan

Scanned2024-06-11T18:04:59+00:00
URL https://gloireadieu.com/robots.txt
Redirect https://www.gloireadieu.com/robots.txt
Redirect Domain www.gloireadieu.com
Redirect Base gloireadieu.com
Domain IPs 104.21.87.64, 172.67.142.1, 2606:4700:3030::ac43:8e01, 2606:4700:3033::6815:5740
Redirect IPs 104.21.87.64, 172.67.142.1, 2606:4700:3030::ac43:8e01, 2606:4700:3033::6815:5740
Response IP 104.21.87.64
Found Yes
Hash d706112ebb4d5d03408b32b1581f271d6f8b254612846d862da4da83d041a818
SimHash 2b435c94527b

Groups

bingbot

Rule Path
Disallow /

applebot

Rule Path
Allow /

baiduspider

Rule Path
Allow /

baiduspider-image

Rule Path
Allow /

applebot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googleother

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

yandexbot

Rule Path
Allow /

yandexbot-mobile

Rule Path
Allow /

yahoo slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

pinterestbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /
Allow /wp-content/uploads/
Disallow /readme.html
Disallow /*.mp4$
Disallow /cgi-bin
Disallow /wp-login.php
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow */trackback
Disallow */comments/
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

Other Records

Field Value
sitemap https://www.gloireadieu.com/sitemap.xml