gentside.com
robots.txt

Robots Exclusion Standard data for gentside.com

Resource Scan

Scan Details

Site Domain gentside.com
Base Domain gentside.com
Scan Status Ok
Last Scan2024-11-06T13:19:33+00:00
Next Scan 2024-11-13T13:19:33+00:00

Last Scan

Scanned2024-11-06T13:19:33+00:00
URL https://gentside.com/robots.txt
Redirect https://www.gentside.com/robots.txt
Redirect Domain www.gentside.com
Redirect Base gentside.com
Domain IPs 185.151.190.98
Redirect IPs 185.151.190.98, 2a0a:1580:2000:1a00::25
Response IP 185.151.190.98
Found Yes
Hash 26f05ac61caafaed66ed637955377b875ea515cb98e71dced11a13c1f4627156
SimHash 43004275dd02

Groups

*

Rule Path
Disallow /xhr/*
Disallow /partial/*
Disallow /landing
Disallow /offline
Disallow /passerelle_ta.php
Disallow *_pic*.html

mediapartners-google

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

spiderbot/nutch-1.7

Rule Path
Disallow /

*
googlebot-news
pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.gentside.com/sitemaps/sitemap.xml
sitemap https://www.gentside.com/sitemaps/google_0.xml
sitemap https://www.gentside.com/sitemaps/pinterest_0.xml
sitemap https://www.gentside.com/sitemaps/pinterest_gallery_0.xml
sitemap https://www.gentside.com/sitemaps/google_0.xml
sitemap https://www.gentside.com/sitemaps/pinterest_0.xml
sitemap https://www.gentside.com/sitemaps/pinterest_gallery_0.xml