techyeagle.com
robots.txt

Robots Exclusion Standard data for techyeagle.com

Resource Scan

Scan Details

Site Domain techyeagle.com
Base Domain techyeagle.com
Scan Status Ok
Last Scan2025-08-08T17:54:41+00:00
Next Scan 2025-09-07T17:54:41+00:00

Last Scan

Scanned2025-08-08T17:54:41+00:00
URL https://techyeagle.com/robots.txt
Domain IPs 104.21.54.171, 172.67.140.176, 2606:4700:3033::ac43:8cb0, 2606:4700:3034::6815:36ab
Response IP 104.21.54.171
Found Yes
Hash f6afb471832b8473b07d1a92b8b319531cd0661829f1d1ae49a6806c351affd4
SimHash 7f34d9f62731

Groups

*

Rule Path
Allow /
Allow /sitemap.xml
Disallow /admin/
Disallow /cache/
Disallow /config/
Disallow /logs/
Disallow /*.log
Disallow /*.txt$
Disallow /*.php$

Other Records

Field Value
crawl-delay 1

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

sogou web spider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap http://example.com/sitemap.xml

Comments

  • 优化爬虫访问
  • 禁止访问管理目录
  • 禁止访问敏感文件
  • Sitemap位置
  • 针对特定搜索引擎的优化

Warnings

  • 3 invalid lines.