manunggaljaya-inhil.desa.id
robots.txt

Robots Exclusion Standard data for manunggaljaya-inhil.desa.id

Resource Scan

Scan Details

Site Domain manunggaljaya-inhil.desa.id
Base Domain manunggaljaya-inhil.desa.id
Scan Status Ok
Last Scan2025-10-29T14:27:43+00:00
Next Scan 2025-11-28T14:27:43+00:00

Last Scan

Scanned2025-10-29T14:27:43+00:00
URL https://www.manunggaljaya-inhil.desa.id/robots.txt
Domain IPs 142.251.10.121, 2404:6800:4003:c00::79
Response IP 74.125.68.121
Found Yes
Hash 833a25cbf60da43cd496e9814f3597223ae2f3cbcaaa86f50dbb821447591b96
SimHash 4954d2505f53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap https://www.manunggaljaya-inhil.desa.id/sitemap.xml