webmaal.in
robots.txt
Robots Exclusion Standard data for webmaal.in
Resource Scan
Scan Details
| Site Domain | webmaal.in |
| Base Domain | webmaal.in |
| Scan Status | Ok |
| Last Scan | 2026-01-10T14:25:13+00:00 |
| Next Scan | 2026-02-09T14:25:13+00:00 |
Last Scan
| Scanned | 2026-01-10T14:25:13+00:00 |
| URL | https://webmaal.in/robots.txt |
| Redirect | https://xo.webmaal.in/robots.txt |
| Redirect Domain | xo.webmaal.in |
| Redirect Base | webmaal.in |
| Domain IPs | 104.21.8.44, 172.67.156.212, 2606:4700:3033::ac43:9cd4, 2606:4700:3037::6815:82c |
| Redirect IPs | 104.21.8.44, 172.67.156.212, 2606:4700:3033::ac43:9cd4, 2606:4700:3037::6815:82c |
| Response IP | 104.21.8.44 |
| Found | Yes |
| Hash | 7fdb7975cda0d0e55267767f40a8fce7504a765291e64d47bd7abdf79f4dc64e |
| SimHash | 46350b53cdd4 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /search |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://aagmaal.tax/sitemap.xml |
Warnings
- `content-signal` is not a known field.
Comments