mawarijo.com
robots.txt

Robots Exclusion Standard data for mawarijo.com

Resource Scan

Scan Details

Site Domain mawarijo.com
Base Domain mawarijo.com
Scan Status Ok
Last Scan2026-01-24T13:38:12+00:00
Next Scan 2026-02-23T13:38:12+00:00

Last Scan

Scanned2026-01-24T13:38:12+00:00
URL https://mawarijo.com/robots.txt
Domain IPs 104.21.83.143, 172.67.177.111, 2606:4700:3030::6815:538f, 2606:4700:3030::ac43:b16f
Response IP 104.21.83.143
Found Yes
Hash 6730db89426d0aab88213f330a1922b3e836bdbe06f9f0819f247e048e025f91
SimHash 695589137393

Groups

googlebot

Rule Path
Disallow
Allow /pub-b8b0685036a243eab306f1d464f9c293.r2.dev/amp-mawarijo.html/

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

*

Rule Path
Disallow
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://mawarijo.com/sitemap.xml