guapizhu.com
robots.txt

Robots Exclusion Standard data for guapizhu.com

Resource Scan

Scan Details

Site Domain guapizhu.com
Base Domain guapizhu.com
Scan Status Ok
Last Scan2025-12-07T00:55:21+00:00
Next Scan 2025-12-14T00:55:21+00:00

Last Scan

Scanned2025-12-07T00:55:21+00:00
URL https://guapizhu.com/robots.txt
Redirect https://www.guapizhu.com/robots.txt
Redirect Domain www.guapizhu.com
Redirect Base guapizhu.com
Domain IPs 39.106.13.171
Redirect IPs 39.106.13.171
Response IP 39.106.13.171
Found Yes
Hash a9f507421eb1683e3aa8b5f9e4a1ced4455cc6cd0e0ab581f71d691e740f4dcf
SimHash 6e3c5870ecd3

Groups

*

Rule Path Comment
Disallow /wp-admin/ 禁止爬取后台
Disallow /wp-includes/ 禁止爬取核心文件
Disallow /feed/ 禁止RSS订é˜
Disallow /trackback/ 禁止Trackback(可选)
Allow / å

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 8

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 10 每隔10秒抓取一次(单位:秒)

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.guapizhu.com/wp-sitemap.xml

Comments

  • 控制爬取频率(部分爬虫支持)
  • 站点地图地址

Warnings

  • 3 invalid lines.