lawinsider.com
robots.txt
Robots Exclusion Standard data for lawinsider.com
Resource Scan
Scan Details
Site Domain | lawinsider.com |
Base Domain | lawinsider.com |
Scan Status | Ok |
Last Scan | 2024-11-13T21:13:27+00:00 |
Next Scan | 2024-11-20T21:13:27+00:00 |
Last Scan
Scanned | 2024-11-13T21:13:27+00:00 |
URL | https://lawinsider.com/robots.txt |
Redirect | https://www.lawinsider.com:443/robots.txt |
Redirect Domain | www.lawinsider.com |
Redirect Base | lawinsider.com |
Domain IPs | 2600:1901:0:142c::, 34.96.69.209 |
Redirect IPs | 2600:1901:0:142c::, 34.96.69.209 |
Response IP | 34.96.69.209 |
Found | Yes |
Hash | a7f410a9fb8b2f680c01a104580a80596eec9d2b18384480f4d41f7ddd56877b |
SimHash | cb117850e712 |
Groups
*
Rule | Path |
---|---|
Disallow | /contracts/tagged/*%2B* |
Disallow | /contracts/tagged/*%2B* |
Disallow | /signin* |
Disallow | /signup* |
Disallow | /user/* |
Disallow | /api/search |
Disallow | /contracts/*.pdf$ |
Disallow | /contracts/*.docx$ |
Disallow | /contracts/*.drive$ |
Disallow | /clause/*.pdf$ |
Disallow | /clause/*.docx$ |
Disallow | /clause/*.drive$ |
Disallow | /search* |
Other Records
Field | Value |
---|---|
sitemap | https://www.lawinsider.com/sitemap.xml |