nomix.it
robots.txt

Robots Exclusion Standard data for nomix.it

Resource Scan

Scan Details

Site Domain nomix.it
Base Domain nomix.it
Scan Status Ok
Last Scan2024-12-21T03:52:27+00:00
Next Scan 2024-12-28T03:52:27+00:00

Last Scan

Scanned2024-12-21T03:52:27+00:00
URL https://nomix.it/robots.txt
Redirect https://www.nomix.it/robots.txt
Redirect Domain www.nomix.it
Redirect Base nomix.it
Domain IPs 46.4.120.136
Redirect IPs 46.4.120.136
Response IP 46.4.120.136
Found Yes
Hash 063dedbe199077c2926f21199ecd8e26aa434165af3433a13a79c855dcb49aec
SimHash 66185840cd92

Groups

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

*

Rule Path
Disallow /gestione/
Disallow /cnt/
Disallow /elementi/
Disallow /doubleclick/
Disallow /risultati-della-ricerca.php

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.nomix.it/sitemap.xml
sitemap https://www.nomix.it/sitemap-onomastici.xml
sitemap https://www.nomix.it/sitemap-mappe.xml.gz

Comments

  • robots.txt