readfrom.net
robots.txt
Robots Exclusion Standard data for readfrom.net
Resource Scan
Scan Details
Site Domain | readfrom.net |
Base Domain | readfrom.net |
Scan Status | Ok |
Last Scan | 2024-11-11T07:49:30+00:00 |
Next Scan | 2024-11-18T07:49:30+00:00 |
Last Scan
Scanned | 2024-11-11T07:49:30+00:00 |
URL | https://readfrom.net/robots.txt |
Domain IPs | 101.99.94.14 |
Response IP | 101.99.94.14 |
Found | Yes |
Hash | dd138fab2dcee245713baff8ef135252c722d0b03e7c2726c976f8a30c94f70e |
SimHash | 7908bc72c733 |
Groups
*
Rule | Path |
---|---|
Disallow | /engine/go.php |
Disallow | /engine/download.php |
Disallow | /user/ |
Disallow | /newposts/ |
Disallow | /statistics.html |
Disallow | /*subaction%3Duserinfo |
Disallow | /*subaction%3Dnewposts |
Disallow | /*do%3Dlastcomments |
Disallow | /*do%3Dfeedback |
Disallow | /*do%3Dlostpassword |
Disallow | /*do%3Daddnews |
Disallow | /*do%3Dstats |
Disallow | /*do%3Dpm |
Disallow | /*do%3Dsearch |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://readfrom.net/sitemap.xml |
Warnings
- `host` is not a known field.