armaghankashan.com
robots.txt

Robots Exclusion Standard data for armaghankashan.com

Resource Scan

Scan Details

Site Domain armaghankashan.com
Base Domain armaghankashan.com
Scan Status Ok
Last Scan2025-08-13T15:16:53+00:00
Next Scan 2025-09-12T15:16:53+00:00

Last Scan

Scanned2025-08-13T15:16:53+00:00
URL https://armaghankashan.com/robots.txt
Domain IPs 185.143.233.120, 185.143.234.120
Response IP 185.143.234.120
Found Yes
Hash 461e233277074089998aa3c037e457b69252238977361007c6ae5e3d5f31dc49
SimHash 65d97843e561

Groups

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow

yandexbot

Rule Path
Disallow

sogou spider

Rule Path
Disallow

exabot

Rule Path
Disallow

facebot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

openai

Rule Path
Disallow

chatgptbot

Rule Path
Disallow

anthropicbot

Rule Path
Disallow

*

Rule Path
Disallow /cpanel/
Disallow /login/
Disallow /upanel/
Allow /hupload/
Allow /hlazy/
Allow /hltheme/
Allow /hstheme/
Allow /htemplate/
Allow /stemplate/
Allow /hdownload/
Allow /heditor/
Allow /hcolleagues/
Allow /hads/
Allow /himage/
Allow /hvideo/
Allow /hsong/
Allow /huser/
Allow /hluser/
Allow /hlimage/
Allow /hlshop/
Allow /hshop/
Allow /harticle/
Allow /handroid/
Allow /hgoogle/
Allow /sitemap.xml
Allow /hbanner/
Allow /hicon/
Allow /hsample/
Allow /hexcel/

Other Records

Field Value
sitemap /hgoogle