voyage.gentside.com
robots.txt

Robots Exclusion Standard data for voyage.gentside.com

Resource Scan

Scan Details

Site Domain voyage.gentside.com
Base Domain gentside.com
Scan Status Ok
Last Scan2024-11-11T02:43:03+00:00
Next Scan 2024-11-25T02:43:03+00:00

Last Scan

Scanned2024-11-11T02:43:03+00:00
URL https://voyage.gentside.com/robots.txt
Domain IPs 185.151.190.98, 2a0a:1580:2000:1a00::25
Response IP 185.151.190.98
Found Yes
Hash 4a986f8f029e7c9ddb0568126bba86c19a8c8e4c9bc3a883789f3b562a1df9c2
SimHash 4104c954d50a

Groups

*

Rule Path
Disallow /xhr/*
Disallow /partial/*
Disallow /landing
Disallow /offline
Disallow /passerelle_ta.php
Disallow *_pic*.html

mediapartners-google

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

spiderbot/nutch-1.7

Rule Path
Disallow /

*
googlebot-news
pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://voyage.gentside.com/sitemaps/sitemap.xml
sitemap https://voyage.gentside.com/sitemaps/google_0.xml
sitemap https://voyage.gentside.com/sitemaps/pinterest_0.xml
sitemap https://voyage.gentside.com/sitemaps/pinterest_gallery_0.xml
sitemap https://voyage.gentside.com/sandbox
sitemap https://voyage.gentside.com/sitemaps/google_0.xml
sitemap https://voyage.gentside.com/sitemaps/pinterest_0.xml
sitemap https://voyage.gentside.com/sitemaps/pinterest_gallery_0.xml