dakahliya.com
robots.txt

Robots Exclusion Standard data for dakahliya.com

Resource Scan

Scan Details

Site Domain dakahliya.com
Base Domain dakahliya.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-05-06T03:42:34+00:00
Next Scan 2025-05-07T03:42:34+00:00

Last Successful Scan

Scanned2025-04-29T03:42:22+00:00
URL https://dakahliya.com/robots.txt
Domain IPs 65.108.104.232, 65.109.39.175
Response IP 65.109.39.175
Found Yes
Hash 2014237e4ef73a8415f4636b8330461284ebad0bc23728fdc7f30f7c19151009
SimHash 4110cba06283

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /trackback/
Disallow /*?sort=
Disallow /*xmlrpc
Disallow /*?s=
Disallow /*?

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

yandexbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://dakahliya.com/news-sitemap.xml
sitemap https://dakahliya.com/sitemap_index.xml