news.halalkhalij.com
robots.txt

Robots Exclusion Standard data for news.halalkhalij.com

Resource Scan

Scan Details

Site Domain news.halalkhalij.com
Base Domain halalkhalij.com
Scan Status Ok
Last Scan2024-11-03T07:29:01+00:00
Next Scan 2024-12-03T07:29:01+00:00

Last Scan

Scanned2024-11-03T07:29:01+00:00
URL https://news.halalkhalij.com/robots.txt
Domain IPs 104.21.87.152, 172.67.144.30, 2606:4700:3031::ac43:901e, 2606:4700:3034::6815:5798
Response IP 172.67.144.30
Found Yes
Hash ebe7c685ba053eba83138dea5e42b8b86deee7cc747aa3c612e94a08755738f0
SimHash eb199801ef12

Groups

*

Rule Path
Disallow /panel
Disallow /cron
Disallow /ajax
Disallow /widgets_factory
Disallow /auth
Disallow /login
Disallow /register
Disallow /style
Disallow /printit
Disallow /emailthis
Disallow /outside

Other Records

Field Value
sitemap https://halalkhalij.com/sitemap.xml
sitemap https://halalkhalij.com/sitemap.xml?format=google_news