madrasa.guide
robots.txt

Robots Exclusion Standard data for madrasa.guide

Resource Scan

Scan Details

Site Domain madrasa.guide
Base Domain madrasa.guide
Scan Status Ok
Last Scan2025-11-28T20:28:48+00:00
Next Scan 2025-12-05T20:28:48+00:00

Last Scan

Scanned2025-11-28T20:28:48+00:00
URL https://madrasa.guide/robots.txt
Redirect https://www.madrasa.guide/robots.txt
Redirect Domain www.madrasa.guide
Redirect Base madrasa.guide
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 142.250.4.121, 2404:6800:4003:c01::79
Response IP 74.125.24.121
Found Yes
Hash 07ff81eb714edd2b633a5bfa50731e274365e701d7b0c5458caa74e4acb334a4
SimHash 6d0494405490

Groups

mediapartners-google

Rule Path
Disallow

google-display-ads-bot

Rule Path
Disallow

*

Rule Path
Allow /
Disallow /search
Allow /search/label/

Other Records

Field Value
sitemap https://www.madrasa.guide/sitemap.xml
sitemap https://www.madrasa.guide/sitemap-pages.xml