mia.org.my
robots.txt
Robots Exclusion Standard data for mia.org.my
Resource Scan
Scan Details
| Site Domain | mia.org.my |
| Base Domain | mia.org.my |
| Scan Status | Ok |
| Last Scan | 2026-01-22T06:16:46+00:00 |
| Next Scan | 2026-02-21T06:16:46+00:00 |
Last Scan
| Scanned | 2026-01-22T06:16:46+00:00 |
| URL | https://mia.org.my/robots.txt |
| Domain IPs | 104.26.12.139, 104.26.13.139, 172.67.74.84, 2606:4700:20::681a:c8b, 2606:4700:20::681a:d8b, 2606:4700:20::ac43:4a54 |
| Response IP | 172.67.74.84 |
| Found | Yes |
| Hash | a9ca965126620991bac6fba8445a21a6b5f3837765179f945b275158b20ff791 |
| SimHash | 44354913cd54 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /*blackhole |
| Disallow | /?blackhole |
*
| Rule | Path |
|---|---|
| Disallow |
Other Records
| Field | Value |
|---|---|
| sitemap | https://mia.org.my/sitemap_index.xml |
Warnings
- `content-signal` is not a known field.
Comments