maithili.org.in
robots.txt

Robots Exclusion Standard data for maithili.org.in

Resource Scan

Scan Details

Site Domain maithili.org.in
Base Domain maithili.org.in
Scan Status Ok
Last Scan2025-12-03T19:18:32+00:00
Next Scan 2025-12-10T19:18:32+00:00

Last Scan

Scanned2025-12-03T19:18:32+00:00
URL https://maithili.org.in/robots.txt
Redirect https://www.maithili.org.in/robots.txt
Redirect Domain www.maithili.org.in
Redirect Base maithili.org.in
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2404:6800:4003:c00::79, 74.125.68.121
Response IP 74.125.200.121
Found Yes
Hash 5d2526cd2ba66a8510f9c09a2af2cac1f86b6b0a053b9ad40927692ae49d0f8c
SimHash 0d1492505f13

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap https://www.maithili.org.in/sitemap.xml