upcounsel.com
robots.txt

Robots Exclusion Standard data for upcounsel.com

Resource Scan

Scan Details

Site Domain upcounsel.com
Base Domain upcounsel.com
Scan Status Ok
Last Scan2024-11-09T16:43:28+00:00
Next Scan 2024-11-16T16:43:28+00:00

Last Scan

Scanned2024-11-09T16:43:28+00:00
URL https://upcounsel.com/robots.txt
Domain IPs 104.21.24.246, 172.67.221.58, 2606:4700:3034::ac43:dd3a, 2606:4700:3036::6815:18f6
Response IP 172.67.221.58
Found Yes
Hash 0a2fd42daf303a5ad53c80dec8eb7128761a3853604dd910f89e9aeafa513ce6
SimHash 354195d34f72

Groups

easouspider

Rule Path
Disallow /

*

Rule Path
Disallow /account/activate
Disallow /account/edit
Disallow /account/noop/
Disallow /account/add_credit_card/
Disallow /account/add_bank_account/
Disallow /checkout/
Disallow /pending
Disallow /suspended
Disallow /invoice/
Disallow /invoices/
Disallow /radmin/
Disallow /ajax/
Disallow /404.html
Disallow /410.html
Disallow /422.html
Disallow /500.html
Disallow /503.html
Disallow /search?q=*
Disallow /jobs/
Disallow /auth/
Disallow /assets/ignored-*.js
Disallow /logger/r
Disallow /events/cl

Other Records

Field Value
sitemap https://www.upcounsel.com/sitemap.xml.gz

Comments

  • robots.txt
  • Disallow: */2$
  • Disallow: */3$
  • Disallow: */4$
  • Disallow: */5$
  • Disallow: */6$
  • Disallow: */7$
  • Disallow: */8$
  • Disallow: */9$
  • Disallow: */all$