sumyinfo.com
robots.txt

Robots Exclusion Standard data for sumyinfo.com

Resource Scan

Scan Details

Site Domain sumyinfo.com
Base Domain sumyinfo.com
Scan Status Ok
Last Scan2026-01-07T18:49:01+00:00
Next Scan 2026-02-06T18:49:01+00:00

Last Scan

Scanned2026-01-07T18:49:01+00:00
URL https://sumyinfo.com/robots.txt
Domain IPs 104.21.34.167, 172.67.163.73, 2606:4700:3031::ac43:a349, 2606:4700:3035::6815:22a7
Response IP 104.21.34.167
Found Yes
Hash 90f409c001d6326c917f479c4811c7a2ddfebf13b958eb6f1fa0a819193a0669
SimHash 6b094a72ce11

Groups

seznambot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

msnbot-media

Rule Path
Disallow /

teleport

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

mail.ru

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 5 задает таймаут в 2 секунды

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /trackback
Disallow */trackback
Disallow */*/trackback

Other Records

Field Value
sitemap https://sumyinfo.com/sitemap.xml

Warnings

  • `host` is not a known field.