littlegreenfootballs.com
robots.txt
Robots Exclusion Standard data for littlegreenfootballs.com
Resource Scan
Scan Details
Site Domain | littlegreenfootballs.com |
Base Domain | littlegreenfootballs.com |
Scan Status | Ok |
Last Scan | 2024-10-19T06:03:21+00:00 |
Next Scan | 2024-10-26T06:03:21+00:00 |
Last Scan
Scanned | 2024-10-19T06:03:21+00:00 |
URL | https://littlegreenfootballs.com/robots.txt |
Domain IPs | 216.170.124.124 |
Response IP | 216.170.124.124 |
Found | Yes |
Hash | d8fa5a5b4f410cdf26402cad93065e9e49209b332cfcbcfc09fc887214d72222 |
SimHash | f4179b162010 |
Groups
turnitinbot
Rule | Path |
---|---|
Disallow | / |
Disallow | /weblog/spam_hole/ |
Disallow | /weblog/lgf-search-requests.php |
Disallow | /weblog/lgf-tagstorm.php |
Disallow | /weblog/lgf-user-manage.php |
Disallow | /weblog/lgf-getconfirm.php |
Disallow | /weblog/lgf-favorites.php |
Disallow | /weblog/lgf-feeds.php |
Disallow | /weblog/lgf-subscription-thanks.php |
Disallow | /user/ |
Disallow | /comment/ |
Disallow | /print/ |
Disallow | /tag/ |
Disallow | /day/ |
Disallow | /spy/ |
Other Records
Field | Value |
---|---|
sitemap | https://littlegreenfootballs.com/weblog/sitemaps/sitemap-index.xml |
Comments