khanherbals.com
robots.txt

Robots Exclusion Standard data for khanherbals.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	khanherbals.com
Base Domain	khanherbals.com
Scan Status	Ok
Last Scan	2026-01-30T06:51:41+00:00
Next Scan	2026-03-01T06:51:41+00:00

Last Scan

Scanned	2026-01-30T06:51:41+00:00
URL	https://khanherbals.com/robots.txt
Domain IPs	104.21.26.231, 172.67.168.147, 2606:4700:3030::ac43:a893, 2606:4700:3032::6815:1ae7
Response IP	104.21.26.231
Found	Yes
Hash	7af6e8adfdab5651829e18abff913a0ac5c7bf891d05947088608520c3956447
SimHash	2b148c12e3d0

Groups

*

Rule	Path
Allow	/
Disallow	/admin/
Disallow	/includes/
Disallow	/assets/fonts/
Disallow	/cache/
Disallow	/tmp/
Disallow	/uploads/

Rule

Path

Allow

/

Disallow

/admin/

Disallow

/includes/

Disallow

/assets/fonts/

Disallow

/cache/

Disallow

/tmp/

Disallow

/uploads/

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

/

Other Records

Field	Value
crawl-delay	0

Field

Value

crawl-delay

0

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

/

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	http://khanherbals-modern.local/sitemap.xml.php

Field

Value

sitemap

http://khanherbals-modern.local/sitemap.xml.php

Back to top

Comments

Khan Herbals - Robots.txt
This file helps search engines crawl and index the website efficiently
Allow all robots to crawl
Allow specific crawlers to index
Block bad bots
Sitemap location

Back to top

khanherbals.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

Other Records

bingbot

Other Records

ahrefsbot

semrushbot

Other Records

Comments

khanherbals.com
robots.txt