cloudlion.me
robots.txt

Robots Exclusion Standard data for cloudlion.me

Resource Scan

Scan Details

Site Domain cloudlion.me
Base Domain cloudlion.me
Scan Status Ok
Last Scan2025-03-20T03:07:54+00:00
Next Scan 2025-04-19T03:07:54+00:00

Last Scan

Scanned2025-03-20T03:07:54+00:00
URL https://www.cloudlion.me/robots.txt
Domain IPs 104.21.48.76, 172.67.181.186, 2606:4700:3035::6815:304c, 2606:4700:3035::ac43:b5ba
Response IP 104.21.48.76
Found Yes
Hash e088a3802d0fb0b7b8f9b81ece0226474dc23e4c934ce7648e99fb23552a8bc4
SimHash 7d5c9a60a5f2

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Disallow /app/

bingbot

Rule Path
Disallow /app/

Other Records

Field Value
sitemap https://my.cloudlion.me/sitemap.xml

Comments

  • 通用规则:默认禁止所有爬虫访问整个网站
  • 特殊规则:å
  • 特殊规则:å
  • 提供站点地图路径

Warnings

  • 4 invalid lines.