varmatin.com
robots.txt
Robots Exclusion Standard data for varmatin.com
Resource Scan
Scan Details
Site Domain | varmatin.com |
Base Domain | varmatin.com |
Scan Status | Ok |
Last Scan | 2024-06-26T06:55:45+00:00 |
Next Scan | 2024-07-03T06:55:45+00:00 |
Last Scan
Scanned | 2024-06-26T06:55:45+00:00 |
URL | https://varmatin.com/robots.txt |
Redirect | https://www.varmatin.com/robots.txt |
Redirect Domain | www.varmatin.com |
Redirect Base | varmatin.com |
Domain IPs | 80.94.98.229, 80.94.98.231 |
Redirect IPs | 80.94.98.229, 80.94.98.231 |
Response IP | 80.94.98.231 |
Found | Yes |
Hash | 57ab5007e22eecf6f112a787443a467ef93cc773d6703f52b5e174081776ef59 |
SimHash | 4a84583549a3 |
Groups
*
Rule | Path |
---|---|
Disallow | /recherche?search=* |
Disallow | /oa |
Disallow | /user* |
Disallow | /a/ |
Disallow | /edition-du-jour/lire |
Disallow | /auth/ |
Disallow | /index.php/* |
Disallow | /*/get-token* |
Disallow | /*/oaToken/* |
Disallow | /newspapers/read/* |
Disallow | /carnet-avis-deces* |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.varmatin.com/sitemap.xml |
sitemap | https://www.varmatin.com/googlenews.xml |
Comments