wamiz.it
robots.txt

Robots Exclusion Standard data for wamiz.it

Resource Scan

Scan Details

Site Domain wamiz.it
Base Domain wamiz.it
Scan Status Ok
Last Scan2024-11-16T12:56:40+00:00
Next Scan 2024-11-23T12:56:40+00:00

Last Scan

Scanned2024-11-16T12:56:40+00:00
URL https://wamiz.it/robots.txt
Domain IPs 104.18.28.33, 104.18.29.33, 2606:4700::6812:1c21, 2606:4700::6812:1d21
Response IP 104.18.29.33
Found Yes
Hash 18c2ec614fec41433e11d3cc93b464f17ad114a86b7b71cc076c5984d0caeb31
SimHash eb14ce4baaf5

Groups

*

Rule Path
Disallow /*?preview=*
Disallow /*?_website*
Disallow /_fragment?*
Disallow /cdn-cgi/*
Disallow /_monitoring/*
Disallow /api/event
Disallow /amazon/publisher-audiences/clear-cookies
Disallow /*?filter_form*
Disallow /*radius%3D
Disallow /*amp%3Bamp
Disallow /*?order=
Disallow /iscrizione
Disallow /login?
Disallow /profilo/
Disallow /autore/*/*/2
Disallow /tags/
Disallow /esi/*
Disallow /api/tracking/ab_status_collect

Other Records

Field Value
sitemap https://wamiz.it/sitemap.xml