leaplaw.com
robots.txt

Robots Exclusion Standard data for leaplaw.com

Resource Scan

Scan Details

Site Domain leaplaw.com
Base Domain leaplaw.com
Scan Status Ok
Last Scan2025-05-08T20:24:59+00:00
Next Scan 2025-06-07T20:24:59+00:00

Last Scan

Scanned2025-05-08T20:24:59+00:00
URL https://leaplaw.com/robots.txt
Domain IPs 104.21.25.168, 172.67.134.102, 2606:4700:3034::6815:19a8, 2606:4700:3037::ac43:8666
Response IP 172.67.134.102
Found Yes
Hash ca373333d6d62074e20947cd2d6826c6e60cb1b146c1d490be281eb4a9986c8e
SimHash 24c38b008955

Groups

atn_worldwide

Rule Path
Disallow

scooter

Rule Path
Disallow

architextspider

Rule Path
Disallow

fast-webcrawler

Rule Path
Disallow

googlebot

Rule Path
Disallow

slurp

Rule Path
Disallow

lycos_spider_(t-rex)

Rule Path
Disallow

daviesbot

Rule Path
Disallow

zyborg

Rule Path
Disallow

zeus

Rule Path
Disallow

winona

Rule Path
Disallow

*

Rule Path
Disallow /pubSearch/preview/

Comments

  • Robot-Manager was used to generate this file.
  • Copyright (c) 2001-2002 by Sophtware.com, Inc. All Rights Reserved.
  • http://www.websitemanagementtools.com/
  • FULL access (AllThatNet)
  • FULL access (Alta Vista)
  • FULL access (Excite)
  • FULL access (FAST/AllTheWeb)
  • FULL access (Google)
  • FULL access (Inktomi)
  • FULL access (Lycos)
  • FULL access (WholeWeb)
  • FULL access (WiseNut)
  • FULL access (Zeus)
  • FULL access (whatUseek)
  • FULL access (All Spiders)