cwit.lk
robots.txt
Robots Exclusion Standard data for cwit.lk
Resource Scan
Scan Details
Site Domain | cwit.lk |
Base Domain | cwit.lk |
Scan Status | Ok |
Last Scan | 2025-09-17T01:05:51+00:00 |
Next Scan | 2025-10-17T01:05:51+00:00 |
Last Scan
Scanned | 2025-09-17T01:05:51+00:00 |
URL | https://www.cwit.lk/robots.txt |
Domain IPs | 2600:9000:2721:3200:11:cd49:2700:93a1, 2600:9000:2721:3400:11:cd49:2700:93a1, 2600:9000:2721:4000:11:cd49:2700:93a1, 2600:9000:2721:600:11:cd49:2700:93a1, 2600:9000:2721:a200:11:cd49:2700:93a1, 2600:9000:2721:ae00:11:cd49:2700:93a1, 2600:9000:2721:d200:11:cd49:2700:93a1, 2600:9000:2721:d600:11:cd49:2700:93a1, 3.165.102.10, 3.165.102.35, 3.165.102.40, 3.165.102.8 |
Response IP | 3.165.102.35 |
Found | Yes |
Hash | 5286d4d92cbf9a442109d919622607f9958b5b708c2f6e469c4b675fa15e68d2 |
SimHash | a84f8f04c913 |
Groups
*
Rule | Path |
---|---|
Disallow | /404.html |
Disallow | /messages.html |
Disallow | *.php |
Disallow | /reviews-form.html |
Disallow | /sitemap-org.xml |
Other Records
Field | Value |
---|---|
sitemap | https://www.cwit.lk/sitemap.xml |
sitemap | https://www.cwit.lk/images_sitemap.xml |