ilucca.net
robots.txt
Robots Exclusion Standard data for ilucca.net
Resource Scan
Scan Details
Site Domain | ilucca.net |
Base Domain | ilucca.net |
Scan Status | Ok |
Last Scan | 4/30/2025, 8:12:10 PM |
Next Scan | 5/30/2025, 8:12:10 PM |
Last Scan
Scanned | 4/30/2025, 8:12:10 PM |
URL | http://ilucca.net/robots.txt |
Redirect | https://www.lucca.fr/robots.txt |
Redirect Domain | www.lucca.fr |
Redirect Base | lucca.fr |
Domain IPs | 213.186.33.5 |
Redirect IPs | 213.186.33.24 |
Response IP | 213.186.33.24 |
Found | Yes |
Hash | 81b92c5881e6564878bcb9abcf758e12fa9a072c091de179bc148edf78edb0c0 |
SimHash | 54447840c111 |
Groups
*
Rule | Path |
---|---|
Disallow | /test-vp.html |
Disallow | /magazine/wp-content/ |
Disallow | /magazine/wp-admin/ |
Disallow | /magazine/wp-includes/ |
Disallow | /filter_by%3Dpopular |
Disallow | /filter_by%3Drandom_posts |
Disallow | /filter_by%3Dpopular7 |
Disallow | /filter_by%3Dfeatured |
Disallow | /filter_by%3Dreview_high |
Disallow | /blog/wp-content/ |
Disallow | /blog/wp-admin/ |
Disallow | /blog/wp-includes/ |
Disallow | /newsletter/ |
Disallow | /404 |
Disallow | /? |
Other Records
Field | Value |
---|---|
sitemap | https://www.lucca.fr/sitemap.xml |