gala-news.fr
robots.txt

Robots Exclusion Standard data for gala-news.fr

Resource Scan

Scan Details

Site Domain gala-news.fr
Base Domain gala-news.fr
Scan Status Ok
Last Scan2024-06-12T12:54:21+00:00
Next Scan 2024-06-19T12:54:21+00:00

Last Scan

Scanned2024-06-12T12:54:21+00:00
URL http://gala-news.fr/robots.txt
Redirect https://www.gala.fr/robots.txt
Redirect Domain www.gala.fr
Redirect Base gala.fr
Domain IPs 81.92.80.55, 81.92.80.56
Redirect IPs 104.69.168.142
Response IP 104.69.168.142
Found Yes
Hash f0c5032b5b1c76de1e9d32f4459309b08792649c6f22be05627e007622a5b658
SimHash 2f2880c68e12

Groups

spiderbot

Rule Path
Disallow /

*

Rule Path
Disallow /google_search/
Disallow /extension/gal/
Disallow /cam/gala/
Disallow /verify-Affichage_Charte*
Disallow *?xtor=*
Disallow /index.php/
Disallow /ajax/
Disallow *?*_gl
Disallow *%3F*_gl
Disallow *?*utm_
Disallow *%3F*utm_

Other Records

Field Value
sitemap https://www.gala.fr/sitemap.xml
sitemap https://www.gala.fr/sitemap/news.xml