i4.cn
robots.txt

Robots Exclusion Standard data for i4.cn

Resource Scan

Scan Details

Site Domain i4.cn
Base Domain i4.cn
Scan Status Ok
Last Scan2024-10-25T18:44:40+00:00
Next Scan 2024-11-24T18:44:40+00:00

Last Scan

Scanned2024-10-25T18:44:40+00:00
URL https://www.i4.cn/robots.txt
Domain IPs 138.113.37.27
Response IP 138.113.37.27
Found Yes
Hash b3a2e4a2b08a08b815207705bd9a549ff67f4a70b8a82d3dc6c7f708876e2d53
SimHash ec2c6672df3e

Groups

*

Rule Path
Allow /index_search.action?*
Disallow /wper_detail_*.html
Disallow /*?*
Disallow /pros/
Disallow /article/
Disallow /news/
Disallow /jiaocheng/
Disallow /install/
Disallow /wallpapers/
Disallow /rings/
Disallow /mobile/
Disallow /m/
Disallow /wenhuajiaoyu/
Disallow /jiaoxuekejian/
Disallow /zonghekejian/
Disallow /shitishijuan/
Disallow /m_ios_news_detail_*.html

Other Records

Field Value
sitemap https://www.i4.cn/sitemap/https/sitemap.xml

Comments

  • -----------------------------------------------------------------------------
  • 禁止爬虫爬取无效URL,提升网站核心静态资源抓取及索引效率。
  • 无效URLåŒ
  • 等各种无需被SE收录的URL。
  • -----------------------------------------------------------------------------

Warnings

  • 1 invalid line.
  • `含` is not a known field.