morelaw.com
robots.txt

Robots Exclusion Standard data for morelaw.com

Resource Scan

Scan Details

Site Domain morelaw.com
Base Domain morelaw.com
Scan Status Ok
Last Scan2026-01-20T02:01:36+00:00
Next Scan 2026-01-27T02:01:36+00:00

Last Scan

Scanned2026-01-20T02:01:36+00:00
URL https://morelaw.com/robots.txt
Domain IPs 162.243.57.61
Response IP 162.243.57.61
Found Yes
Hash 6710224a9e0860fc0ba35f3c075c25cf9234755d900c2e26f700438bba2805dd
SimHash 358c5a13e6f0

Groups

*

Rule Path
Allow /verdicts/
Allow /lawyers/
Allow /news/
Allow /reporters/
Allow /experts/
Allow /cases/
Disallow /admin/
Disallow /morelawadmin/
Disallow /app/api/
Disallow /app/action/
Disallow /*/search/results/
Disallow /*?offset=
Disallow /*?limit=
Disallow /*?back=
Disallow /*?from=

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

gptbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

claudebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

anthropic-ai

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

google-extended

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

perplexitybot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.morelaw.com/sitemap-index.xml

Comments

  • MoreLaw.com Robots.txt
  • Updated for SEO optimization
  • Allow crawling of all main content
  • Block admin and internal areas
  • Block search result pages (use canonical URLs instead)
  • Block session/tracking URLs
  • Sitemap location
  • Crawl rate guidelines for specific bots
  • AI Bot Rate Limiting - allow crawling but slow them down
  • Block aggressive/low-value bots
  • Block known bad bots