wamiz.com
robots.txt
Robots Exclusion Standard data for wamiz.com
Resource Scan
Scan Details
Site Domain | wamiz.com |
Base Domain | wamiz.com |
Scan Status | Ok |
Last Scan | 2024-06-02T15:36:07+00:00 |
Next Scan | 2024-06-09T15:36:07+00:00 |
Last Scan
Scanned | 2024-06-02T15:36:07+00:00 |
URL | https://wamiz.com/robots.txt |
Domain IPs | 104.18.8.37, 104.18.9.37, 2606:4700::6812:825, 2606:4700::6812:925 |
Response IP | 104.18.9.37 |
Found | Yes |
Hash | 392c4b60126d17420c09754f0284ff5d25b78783824054577d40b9a4a05b796d |
SimHash | 7355010befd1 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?preview=* |
Disallow | /*?_website* |
Disallow | /_fragment?* |
Disallow | /cdn-cgi/* |
Disallow | /_monitoring/* |
Disallow | /api/event |
Disallow | /amazon/publisher-audiences/clear-cookies |
Disallow | /*?filter_form* |
Disallow | /*radius%3D |
Disallow | /*amp%3Bamp |
Disallow | /*?order= |
Disallow | /inscription |
Disallow | /login? |
Disallow | /profil/ |
Disallow | /auteur/*/*/2 |
Disallow | /tags/ |
Disallow | /esi/* |
Disallow | /redirect/ |
Disallow | /user/ajaxLogin |
Disallow | /facebook/login |
Disallow | /*018173665941857647736 |
Other Records
Field | Value |
---|---|
sitemap | https://wamiz.com/sitemap.xml |