lhs11.com
robots.txt

Robots Exclusion Standard data for lhs11.com

Resource Scan

Scan Details

Site Domain lhs11.com
Base Domain lhs11.com
Scan Status Ok
Last Scan2025-10-23T02:25:41+00:00
Next Scan 2025-10-30T02:25:41+00:00

Last Scan

Scanned2025-10-23T02:25:41+00:00
URL https://lhs11.com/robots.txt
Domain IPs 47.115.190.222
Response IP 47.115.190.222
Found Yes
Hash 4871c48e34c0fe6611b628b1fa59555a8b53b775940536d6ab6b0d4ca4a1fe5e
SimHash 0414cc5366b4

Groups

*

Rule Path
Allow /
Disallow /news/%5Cd%2B-.*
Disallow /changlong-travel-guide/%5Cd%2B
Disallow /travel-news/%5Cd%2B
Disallow /about/%5Cd%2B
Disallow /culture/%5Cd%2B
Disallow /business/%5Cd%2B
Disallow /contact/%5Cd%2B

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Allow /
Disallow /admin/
Disallow /private/
Disallow /tmp/
Disallow /backup/
Disallow /*.log$
Disallow /*.sql$
Disallow /*.bak$

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.lhs11.com/sitemap.xml

Comments

  • 避å
  • Sitemap
  • Crawl-delay for all bots
  • Specific directives for major search engines
  • Disallow certain directories/files
  • Disallow certain file types
  • Host directive

Warnings

  • 2 invalid lines.
  • `host` is not a known field.