gala.fr
robots.txt

Robots Exclusion Standard data for gala.fr

Resource Scan

Scan Details

Site Domain gala.fr
Base Domain gala.fr
Scan Status Ok
Last Scan2024-05-09T12:21:11+00:00
Next Scan 2024-05-16T12:21:11+00:00

Last Scan

Scanned2024-05-09T12:21:11+00:00
URL https://gala.fr/robots.txt
Redirect https://www.gala.fr/robots.txt
Redirect Domain www.gala.fr
Redirect Base gala.fr
Domain IPs 52.31.223.244
Redirect IPs 23.50.86.135
Response IP 104.69.47.225
Found Yes
Hash f0c5032b5b1c76de1e9d32f4459309b08792649c6f22be05627e007622a5b658
SimHash 2f2880c68e12

Groups

spiderbot

Rule Path
Disallow /

*

Rule Path
Disallow /google_search/
Disallow /extension/gal/
Disallow /cam/gala/
Disallow /verify-Affichage_Charte*
Disallow *?xtor=*
Disallow /index.php/
Disallow /ajax/
Disallow *?*_gl
Disallow *%3F*_gl
Disallow *?*utm_
Disallow *%3F*utm_

Other Records

Field Value
sitemap https://www.gala.fr/sitemap.xml
sitemap https://www.gala.fr/sitemap/news.xml