langthorns.com
robots.txt
Robots Exclusion Standard data for langthorns.com
Resource Scan
Scan Details
| Site Domain | langthorns.com |
| Base Domain | langthorns.com |
| Scan Status | Ok |
| Last Scan | 2025-10-12T02:43:16+00:00 |
| Next Scan | 2025-11-11T02:43:16+00:00 |
Last Scan
| Scanned | 2025-10-12T02:43:16+00:00 |
| URL | https://langthorns.com/robots.txt |
| Domain IPs | 104.21.78.217, 172.67.137.164, 2606:4700:3034::6815:4ed9, 2606:4700:3036::ac43:89a4 |
| Response IP | 104.21.78.217 |
| Found | Yes |
| Hash | 02060b17368e956c1fd7ba2a100dab071468c97b0961e7f57d54c46d1af14b38 |
| SimHash | 0c29ca446993 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /*?page=$ |
| Disallow | /*%26page%3D$ |
| Disallow | /*?sort= |
| Disallow | /*%26sort%3D |
| Disallow | /*?order= |
| Disallow | /*%26order%3D |
| Disallow | /*?limit= |
| Disallow | /*%26limit%3D |
| Disallow | /*?filter_name= |
| Disallow | /*%26filter_name%3D |
| Disallow | /*?filter_sub_category= |
| Disallow | /*%26filter_sub_category%3D |
| Disallow | /*?filter_description= |
| Disallow | /*%26filter_description%3D |