ilucca.net
robots.txt

Robots Exclusion Standard data for ilucca.net

Resource Scan

Scan Details

Site Domain ilucca.net
Base Domain ilucca.net
Scan Status Ok
Last Scan4/30/2025, 8:12:10 PM
Next Scan 5/30/2025, 8:12:10 PM

Last Scan

Scanned4/30/2025, 8:12:10 PM
URL http://ilucca.net/robots.txt
Redirect https://www.lucca.fr/robots.txt
Redirect Domain www.lucca.fr
Redirect Base lucca.fr
Domain IPs 213.186.33.5
Redirect IPs 213.186.33.24
Response IP 213.186.33.24
Found Yes
Hash 81b92c5881e6564878bcb9abcf758e12fa9a072c091de179bc148edf78edb0c0
SimHash 54447840c111

Groups

*

Rule Path
Disallow /test-vp.html
Disallow /magazine/wp-content/
Disallow /magazine/wp-admin/
Disallow /magazine/wp-includes/
Disallow /filter_by%3Dpopular
Disallow /filter_by%3Drandom_posts
Disallow /filter_by%3Dpopular7
Disallow /filter_by%3Dfeatured
Disallow /filter_by%3Dreview_high
Disallow /blog/wp-content/
Disallow /blog/wp-admin/
Disallow /blog/wp-includes/
Disallow /newsletter/
Disallow /404
Disallow /?

Other Records

Field Value
sitemap https://www.lucca.fr/sitemap.xml