trovaricetta.com
robots.txt

Robots Exclusion Standard data for trovaricetta.com

Resource Scan

Scan Details

Site Domain trovaricetta.com
Base Domain trovaricetta.com
Scan Status Ok
Last Scan2024-11-12T16:41:55+00:00
Next Scan 2024-11-19T16:41:55+00:00

Last Scan

Scanned2024-11-12T16:41:55+00:00
URL https://trovaricetta.com/robots.txt
Redirect https://www.trovaricetta.com/robots.txt
Redirect Domain www.trovaricetta.com
Redirect Base trovaricetta.com
Domain IPs 104.26.2.120, 104.26.3.120, 172.67.75.3, 2606:4700:20::681a:278, 2606:4700:20::681a:378, 2606:4700:20::ac43:4b03
Redirect IPs 104.26.2.120, 104.26.3.120, 172.67.75.3, 2606:4700:20::681a:278, 2606:4700:20::681a:378, 2606:4700:20::ac43:4b03
Response IP 104.26.3.120
Found Yes
Hash 6410ad875027476339b80b59d3c0f2fc6a0e479bf9afb767131a2252357c783c
SimHash e01559184f13

Groups

*

Rule Path
Disallow /ricetta/*
Disallow /app/*

sitebot

Rule Path
Disallow /

baidu

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Comments

  • too many repeated hits, too quick
  • too many repeated hits, too quick
  • too many repeated hits, too quick