trovacuccioli.com
robots.txt

Robots Exclusion Standard data for trovacuccioli.com

Resource Scan

Scan Details

Site Domain trovacuccioli.com
Base Domain trovacuccioli.com
Scan Status Ok
Last Scan2024-09-26T18:00:10+00:00
Next Scan 2024-10-03T18:00:10+00:00

Last Scan

Scanned2024-09-26T18:00:10+00:00
URL https://trovacuccioli.com/robots.txt
Domain IPs 2001:8d8:100f:f000::29d, 217.160.0.126
Response IP 217.160.0.126
Found Yes
Hash cb787ef4cfb13a320e17c06f347603025b5bfa793f0974f05788479231bfd9bd
SimHash 25c3395055db

Groups

*

Rule Path
Disallow
Disallow /?view=showad&adid=*&cityid=*&reported=y
Disallow /?view=showad&adid=*&cityid=*&do=reportabuse
Disallow /cgi-bin/
Disallow /admin/
Disallow /cron/
Disallow /js/
Disallow /css/
Disallow /images/
Disallow /lang/
Disallow /log/
Disallow /mailtemplates/
Disallow /theme/
Disallow /_tumbs/
Disallow /userimgs/
Disallow /?view=mailad*