kathysteinemann.com
robots.txt
Robots Exclusion Standard data for kathysteinemann.com
Resource Scan
Scan Details
Site Domain | kathysteinemann.com |
Base Domain | kathysteinemann.com |
Scan Status | Ok |
Last Scan | 2024-10-29T20:40:59+00:00 |
Next Scan | 2024-11-28T20:40:59+00:00 |
Last Scan
Scanned | 2024-10-29T20:40:59+00:00 |
URL | https://kathysteinemann.com/robots.txt |
Domain IPs | 173.236.139.243 |
Response IP | 173.236.139.243 |
Found | Yes |
Hash | efd03825b7852bcea7dbb41b14fa17a95f0f473fc3d97acbcb71a7c6012f7800 |
SimHash | 43e9784191a1 |
Groups
*
Rule | Path |
---|---|
Disallow | /*admin |
Disallow | /*feed |
Disallow | /*.asp$ |
Disallow | /cgi-bin/ |
Disallow | /labels.rdf |
Disallow | /*/author/ |
Disallow | /*/tag/ |
Disallow | /.well-known |
Disallow | /*?replytocom=* |
Disallow | /*?share=* |
Disallow | /*?ak_action=* |
Disallow | /*?p=* |
Disallow | /*?captcha_code=* |
Disallow | /*/wp-json/oembed/ |
Disallow | /*submit%3D |
Disallow | /*/contact* |
Disallow | /*/page/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Warnings
- `user agent` is not a known field.