icqia.com
robots.txt

Robots Exclusion Standard data for icqia.com

Resource Scan

Scan Details

Site Domain icqia.com
Base Domain icqia.com
Scan Status Ok
Last Scan2026-02-03T15:07:32+00:00
Next Scan 2026-03-05T15:07:32+00:00

Last Scan

Scanned2026-02-03T15:07:32+00:00
URL https://icqia.com/robots.txt
Domain IPs 104.21.26.140, 172.67.136.110, 2606:4700:3031::6815:1a8c, 2606:4700:3036::ac43:886e
Response IP 172.67.136.110
Found Yes
Hash d7144df23717e3ea48e34246cbed77271d9ad85c495aa402bd3a8e93eb24f246
SimHash 480cfa077f92

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /*.json$

baiduspider

Rule Path
Allow /
Disallow /admin/
Disallow /api/

googlebot

Rule Path
Allow /
Disallow /admin/
Disallow /api/

sogou web spider

Rule Path
Allow /
Disallow /admin/
Disallow /api/

Other Records

Field Value
sitemap https://yourdomain.com/sitemap.xml

Comments

  • robots.txt for 精选视频网站
  • 百度爬虫
  • Google 爬虫
  • 搜狗爬虫
  • 360搜索爬虫
  • Sitemap位置(请替换为你的实际域名)

Warnings

  • 4 invalid lines.