interestingengineering.com
robots.txt
Robots Exclusion Standard data for interestingengineering.com
Resource Scan
Scan Details
Site Domain | interestingengineering.com |
Base Domain | interestingengineering.com |
Scan Status | Ok |
Last Scan | 2024-06-08T10:32:57+00:00 |
Next Scan | 2024-06-15T10:32:57+00:00 |
Last Scan
Scanned | 2024-06-08T10:32:57+00:00 |
URL | https://interestingengineering.com/robots.txt |
Domain IPs | 104.26.14.179, 104.26.15.179, 172.67.75.65, 2606:4700:20::681a:eb3, 2606:4700:20::681a:fb3, 2606:4700:20::ac43:4b41 |
Response IP | 104.26.15.179 |
Found | Yes |
Hash | d1c280c70ad011a969aad2833b66a04142b8de8b9c4898758b594b619aa9c539 |
SimHash | 498c4132a751 |
Groups
*
Rule | Path |
---|---|
Disallow | /s/* |
Disallow | /redir/* |
Disallow | /newsletter/* |
Disallow | /partial/* |
Disallow | /*?context_item_id |
Other Records
Field | Value |
---|---|
sitemap | https://interestingengineering.com/sitemap_index.xml |
sitemap | https://interestingengineering.com/news-sitemap.xml |
Warnings
- 1 invalid line.