le-gr20.fr
robots.txt
Robots Exclusion Standard data for le-gr20.fr
Resource Scan
Scan Details
Site Domain | le-gr20.fr |
Base Domain | le-gr20.fr |
Scan Status | Ok |
Last Scan | 2024-06-19T01:55:31+00:00 |
Next Scan | 2024-07-19T01:55:31+00:00 |
Last Scan
Scanned | 2024-06-19T01:55:31+00:00 |
URL | https://le-gr20.fr/robots.txt |
Redirect | https://www.le-gr20.fr/robots.txt |
Redirect Domain | www.le-gr20.fr |
Redirect Base | le-gr20.fr |
Domain IPs | 212.83.158.154 |
Redirect IPs | 212.83.158.154 |
Response IP | 212.83.158.154 |
Found | Yes |
Hash | 4817d70c79bf5b56afb3be0edba4f153d4c5c3e6d0841d44e6a7497523bef70d |
SimHash | 8b5cdc406610 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /storage/do_xml/id/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.le-gr20.fr/en/sitemap.xml |
sitemap | https://www.le-gr20.fr/sitemap.xml |