inmaterassi.it
robots.txt

Robots Exclusion Standard data for inmaterassi.it

Resource Scan

Scan Details

Site Domain inmaterassi.it
Base Domain inmaterassi.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-05T17:50:52+00:00
Next Scan 2025-10-12T17:50:52+00:00

Last Successful Scan

Scanned2025-09-04T08:55:55+00:00
URL https://inmaterassi.it/robots.txt
Redirect https://www.inmaterassi.it/it/robots.txt
Redirect Domain www.inmaterassi.it
Redirect Base inmaterassi.it
Domain IPs 185.56.218.19
Redirect IPs 185.56.218.19
Response IP 185.56.218.19
Found Yes
Hash 3aa79c0ad50d5e8ad3374a21449de5ae7bb891e3fabbc78c55060f226b2702ac
SimHash 1814c86768d3

Groups

*

Rule Path
Allow /modules/.css
Allow /modules/.js
Allow /modules/.png
Allow /modules/.jpg
Allow /js/jquery/*

*

Rule Path
Disallow /*?timestamp=
Disallow */?timestamp=
Disallow */?fbclid=
Disallow */?gclid=
Disallow */?utm_source=
Disallow */?utm_content=
Disallow */?s
Disallow */?_gl=
Disallow /*?order=
Disallow /*%26order%3D
Disallow /*?resultsPerPage=
Disallow /*%26resultsPerPage%3D
Disallow /*?selected_filters=
Disallow /*%26selected_filters%3D
Disallow /*?token=

facebookexternalhit

Rule Path
Disallow /

facebookexternalhit/1.1

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.inmaterassi.it/it/1_index_sitemap.xml

Comments

  • Allow Directives
  • Disallow Directives
  • Sitemap