arab-dream.net
robots.txt

Robots Exclusion Standard data for arab-dream.net

Resource Scan

Scan Details

Site Domain arab-dream.net
Base Domain arab-dream.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan3/21/2025, 3:44:35 AM
Next Scan 4/20/2025, 3:44:35 AM

Last Successful Scan

Scanned1/1/2025, 3:40:51 AM
URL https://arab-dream.net/robots.txt
Redirect https://arab-dream.news/robots.txt
Redirect Domain arab-dream.news
Redirect Base arab-dream.news
Domain IPs 104.21.59.31, 172.67.211.238, 2606:4700:3030::6815:3b1f, 2606:4700:3036::ac43:d3ee
Redirect IPs 104.21.89.247, 172.67.166.46, 2606:4700:3031::ac43:a62e, 2606:4700:3034::6815:59f7
Response IP 104.21.89.247
Found Yes
Hash 5b94805fa6b629abf48deef4008c3ef3eaa791c515a7206b0c8d29d08501cfbd
SimHash 456c57d496f7

Groups

*

Rule Path
Allow /
Allow /uploads/*
Disallow /admin/
Disallow /templates/
Disallow /include/
Disallow /lang/
Disallow /Smarty/
Disallow /404.php
Disallow /config.php
Disallow /cdn-cgi/
Disallow *sortby%3D*
Disallow *order%3D*
Disallow *page%3D*
Disallow *%26*
Disallow /*.js$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /?*
Disallow /*.php$
Disallow /new/?*

ia_archiver-web.archive.org
autogpt
agent gpt
anthropic-ai
aria browser ai
aria browse aria ai
aisearchbot
applebot-extended
amazon-kendra
amazon silk
amazon sagemaker
aws trainium
amazon bedrock
alexa
alexatm
amazon textract
amazon comprehend
anypicker
alphaai
ai2 bot
ai2bot-dolma
brave leo ai
bingbot-chat/2.0
bing ai
cohere-ai
claudebot
crewai
claude 3.5 sonnet
claude 3.5 haiku
ccbot
ccbot
ccbot/2.0
cc-crawler/2.0
chatgpt
chatgpt-user
dataprovider
diffbot
dialogpt
depolarizinggpt
doubao ai
duckassistbot
dalvik/2
dalvik/2.1.0
gptbot
gptbot/0.1
gptbot/1.0
gptbot/1.2
gpt-1
gpt-2
gpt-3
gpt-3.5
gpt-4
gpt-4-turbo
gpt-4o
gpt-4v
gpt-4o mini
gpt-3.5 turbo
gpt 4 omni
gpt 4 omni mini
gpt-sw3
gptzero
zerogpt
zerochat
searchgpt
google-extended
google gemini
googleother
google-cloudvertexbot
facebookbot
facebookbot/1.0
facebookexternalhit
facebookexternalhit/1.1
meta-externalfetcher/1.1
meta-externalagent
meta-externalfetcher
meta ai
llama 3.2
friendlycrawler
friendlycrawler/1.0
friendlycrawler/nutch-1.20-snapshot
icc-crawler
intelliseek
intelliseek.ai
imagesiftbot
img2dataset
iaskspider/2.0
kangaroo bot
leftwinggpt
nicecrawler
magpie-crawler
omgilibot
omgili
openai
openbot
openai gpt
oai searchbot
oai-searchbot/1.0
openai o1
openai o1-mini
owler
searchgpt
scrapy
stability ai
scrapergpt
shadowgpt
thehive.ai
rightwinggpt
timpibot
timpibot/0.8
timpibot/0.9
webchatgpt
wormgpt v3.0
wpbot
wpbot/1.1
webzio-extended
velenpublicwebcrawler
paperlibot
paperlibot/2.1
perplexitybot
proximic
peer39_crawler/1.0
piplbot
youbot
yargpt
yarchatgpt
duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap https://arab-dream.news/sitemap-index.xml