news.halalkhalij.com
robots.txt
Robots Exclusion Standard data for news.halalkhalij.com
Resource Scan
Scan Details
Site Domain | news.halalkhalij.com |
Base Domain | halalkhalij.com |
Scan Status | Ok |
Last Scan | 2024-11-03T07:29:01+00:00 |
Next Scan | 2024-12-03T07:29:01+00:00 |
Last Scan
Scanned | 2024-11-03T07:29:01+00:00 |
URL | https://news.halalkhalij.com/robots.txt |
Domain IPs | 104.21.87.152, 172.67.144.30, 2606:4700:3031::ac43:901e, 2606:4700:3034::6815:5798 |
Response IP | 172.67.144.30 |
Found | Yes |
Hash | ebe7c685ba053eba83138dea5e42b8b86deee7cc747aa3c612e94a08755738f0 |
SimHash | eb199801ef12 |
Groups
*
Rule | Path |
---|---|
Disallow | /panel |
Disallow | /cron |
Disallow | /ajax |
Disallow | /widgets_factory |
Disallow | /auth |
Disallow | /login |
Disallow | /register |
Disallow | /style |
Disallow | /printit |
Disallow | /emailthis |
Disallow | /outside |
Other Records
Field | Value |
---|---|
sitemap | https://halalkhalij.com/sitemap.xml |
sitemap | https://halalkhalij.com/sitemap.xml?format=google_news |