awartisan.fr
robots.txt
Robots Exclusion Standard data for awartisan.fr
Resource Scan
Scan Details
| Site Domain | awartisan.fr |
| Base Domain | awartisan.fr |
| Scan Status | Ok |
| Last Scan | 2026-03-01T06:47:31+00:00 |
| Next Scan | 2026-03-31T06:47:31+00:00 |
Last Scan
| Scanned | 2026-03-01T06:47:31+00:00 |
| URL | https://awartisan.fr/robots.txt |
| Redirect | https://www.awartisan.fr/robots.txt |
| Redirect Domain | www.awartisan.fr |
| Redirect Base | awartisan.fr |
| Domain IPs | 104.21.36.17, 172.67.183.18, 2606:4700:3031::ac43:b712, 2606:4700:3034::6815:2411 |
| Redirect IPs | 104.21.36.17, 172.67.183.18, 2606:4700:3031::ac43:b712, 2606:4700:3034::6815:2411 |
| Response IP | 172.67.183.18 |
| Found | Yes |
| Hash | 0baa05fe0f9aa0b2f43eea47c1655aa41aaa85176226eb61ebfc8249034807bf |
| SimHash | 08154f0064d0 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /*.pdf$ |
| Disallow | /return_policy |
| Disallow | /privacy_policy |
| Disallow | /cookies |
| Disallow | /attachment.php* |
| Disallow | /asset_label* |
| Disallow | /page.php* |
| Disallow | /*.sys$ |
| Disallow | /ethics |
| Disallow | /image_root* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.awartisan.fr/sitemaps/es_fr.xml.gz |