clubic.com
robots.txt

Robots Exclusion Standard data for clubic.com

Resource Scan

Scan Details

Site Domain clubic.com
Base Domain clubic.com
Scan Status Ok
Last Scan2024-11-13T09:28:14+00:00
Next Scan 2024-11-20T09:28:14+00:00

Last Scan

Scanned2024-11-13T09:28:14+00:00
URL https://clubic.com/robots.txt
Redirect https://www.clubic.com/robots.txt
Redirect Domain www.clubic.com
Redirect Base clubic.com
Domain IPs 5.135.119.241, 5.135.119.242, 5.135.119.243
Redirect IPs 104.26.0.113, 104.26.1.113, 172.67.68.138, 2606:4700:20::681a:171, 2606:4700:20::681a:71, 2606:4700:20::ac43:448a
Response IP 172.67.68.138
Found Yes
Hash e23c493905c31dbaf29c304e501112f52373430d0dbf6659b1ca0d3f8cc88369
SimHash 4d28c698f512

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /api/
Disallow /r/
Disallow /rss/
Disallow /editorial/
Disallow /club/
Disallow /stats/
Disallow /hub/
Disallow /commentaires/
Disallow /v1/
Disallow /t/
Disallow /*?*
Disallow /app.php/
Disallow /ops
Disallow /telechargement-en-cours/
Disallow /rechercher/
Disallow /tview$
Disallow /cc$
Disallow /sticky/component
Disallow /sitemaps/news.xml
Allow /assets/*
Allow /js/*

googlebot-news

Rule Path
Disallow /telecharger-fiche

Other Records

Field Value
sitemap https://www.clubic.com/sitemap_news.xml
sitemap https://www.clubic.com/sitemaps/index.xml