sprunk.io
robots.txt

Robots Exclusion Standard data for sprunk.io

Resource Scan

Scan Details

Site Domain sprunk.io
Base Domain sprunk.io
Scan Status Ok
Last Scan2025-11-08T01:43:41+00:00
Next Scan 2025-11-15T01:43:41+00:00

Last Scan

Scanned2025-11-08T01:43:41+00:00
URL https://sprunk.io/robots.txt
Domain IPs 104.21.75.21, 172.67.210.61, 2606:4700:3030::6815:4b15, 2606:4700:3036::ac43:d23d
Response IP 104.21.75.21
Found Yes
Hash 751e8038251975d56bd63618e42319aa70ef2136e7d950b04df19980b8c952c1
SimHash 10005e42c882

Groups

*

Rule Path
Allow /
Allow /en/
Allow /ko/
Allow /pt/
Allow /ru/
Disallow /Base-Template/
Disallow /css/
Disallow /js/
Disallow /img/
Disallow /node_modules/
Disallow /.git/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://sprunk.io/sitemap.xml

Comments

  • robots.txt for https://sprunk.io
  • 站点地图
  • 爬虫限制
  • 禁止访问的目录