mesharma.com
robots.txt

Robots Exclusion Standard data for mesharma.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mesharma.com
Base Domain	mesharma.com
Scan Status	Ok
Last Scan	2025-11-20T15:58:54+00:00
Next Scan	2025-12-20T15:58:54+00:00

Last Scan

Scanned	2025-11-20T15:58:54+00:00
URL	https://www.mesharma.com/robots.txt
Domain IPs	104.18.132.62, 104.18.133.62, 104.18.134.62, 104.18.135.62, 104.18.136.62
Response IP	104.18.134.62
Found	Yes
Hash	3ebbaba7d91c6d55fbb4988cbd20401e36a833271c556aef83f884211f88c82a
SimHash	411c894088b2

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

/

nerdybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	60

Field

Value

crawl-delay

60

bubing

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.mesharma.com/sitemap.xml

Field

Value

sitemap

https://www.mesharma.com/sitemap.xml

Back to top

Comments

Cloudflare crawl control rules

Back to top

Warnings

`content-signal` is not a known field.

Back to top

mesharma.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

nerdybot

semrushbot

Other Records

bubing

Other Records

Comments

Warnings

mesharma.com
robots.txt