sport.gentside.com
robots.txt

Robots Exclusion Standard data for sport.gentside.com

Resource Scan

Scan Details

Site Domain sport.gentside.com
Base Domain gentside.com
Scan Status Ok
Last Scan2024-11-06T02:07:17+00:00
Next Scan 2024-11-20T02:07:17+00:00

Last Scan

Scanned2024-11-06T02:07:17+00:00
URL https://sport.gentside.com/robots.txt
Redirect https://www.uppercutmma.com/robots.txt
Redirect Domain www.uppercutmma.com
Redirect Base uppercutmma.com
Domain IPs 185.151.190.98, 2a0a:1580:2000:1a00::25
Redirect IPs 185.151.190.98, 2a0a:1580:2000:1a00::25
Response IP 185.151.190.98
Found Yes
Hash ac0e2389c5e082ccbd71c60afb084a6138529a80b71cff1d71982d35e1ac8e76
SimHash 43084774d582

Groups

*

Rule Path
Disallow /xhr/*
Disallow /partial/*
Disallow /landing
Disallow /offline
Disallow /passerelle_ta.php
Disallow *_pic*.html

mediapartners-google

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

spiderbot/nutch-1.7

Rule Path
Disallow /

*
googlebot-news
pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.uppercutmma.com/sitemaps/sitemap.xml
sitemap https://www.uppercutmma.com/sitemaps/google_0.xml
sitemap https://www.uppercutmma.com/sitemaps/pinterest_0.xml
sitemap https://www.uppercutmma.com/sitemaps/pinterest_gallery_0.xml
sitemap https://www.uppercutmma.com/sitemaps/google_0.xml
sitemap https://www.uppercutmma.com/sitemaps/pinterest_0.xml
sitemap https://www.uppercutmma.com/sitemaps/pinterest_gallery_0.xml