crezu.lk
robots.txt
Robots Exclusion Standard data for crezu.lk
Resource Scan
Scan Details
Site Domain | crezu.lk |
Base Domain | crezu.lk |
Scan Status | Ok |
Last Scan | 2024-11-16T14:05:06+00:00 |
Next Scan | 2024-11-23T14:05:06+00:00 |
Last Scan
Scanned | 2024-11-16T14:05:06+00:00 |
URL | https://crezu.lk/robots.txt |
Domain IPs | 35.200.186.84 |
Response IP | 35.200.186.84 |
Found | Yes |
Hash | dfa60dcd8c798c096580f994ab8f81ac58de0637c12cd6e1ba83cc33c650cd55 |
SimHash | 7b4ecc206fd2 |
Groups
*
Rule | Path |
---|---|
Disallow | /main/ |
Disallow | /offers/ |
Disallow | /google/ |
Disallow | /landing* |
Disallow | /landing/offers/ |
Disallow | /landing/offers2/ |
Disallow | /landing/trafficback/ |
Disallow | /landing/rejected/ |
Disallow | /landing/rejected2/ |
Disallow | /landing/offerwall/ |
Disallow | /ofertas-exclusivas-amazon/ |
Disallow | /img/blog//data/img/* |
Other Records
Field | Value |
---|---|
sitemap | https://crezu.lk/sitemap.xml |
Warnings
- `host` is not a known field.