nutridome.it
robots.txt

Robots Exclusion Standard data for nutridome.it

Resource Scan

Scan Details

Site Domain nutridome.it
Base Domain nutridome.it
Scan Status Ok
Last Scan2025-07-01T22:27:14+00:00
Next Scan 2025-07-15T22:27:14+00:00

Last Scan

Scanned2025-07-01T22:27:14+00:00
URL https://nutridome.it/robots.txt
Domain IPs 104.26.12.166, 104.26.13.166, 172.67.75.230, 2606:4700:20::681a:ca6, 2606:4700:20::681a:da6, 2606:4700:20::ac43:4be6
Response IP 104.26.12.166
Found Yes
Hash fc42c62b3f7b9669a27d3a6492f6e55470e1a56386bc1b6359601a18ae68bdc0
SimHash 0d45d9c047f4

Groups

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

adsbot-google-image

Rule Path
Allow /

*

Rule Path
Disallow /*?sort=
Disallow /*?brand=
Disallow /*?promotion=
Disallow /*%26brand%3D
Disallow /*%26show%3D
Disallow /*%26q%3D

googlebot

Rule Path
Disallow /*?sort=
Disallow /*?brand=
Disallow /*?promotion=
Disallow /*%26brand%3D
Disallow /*%26show%3D
Disallow /*%26q%3D

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

httrack

Rule Path
Disallow /

wget

Rule Path
Disallow /