involute.top
robots.txt

Robots Exclusion Standard data for involute.top

Resource Scan

Scan Details

Site Domain involute.top
Base Domain involute.top
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-31T03:02:17+00:00
Next Scan 2025-01-29T03:02:17+00:00

Last Successful Scan

Scanned2023-12-14T02:17:06+00:00
URL https://www.involute.top/robots.txt
Domain IPs 13.215.144.61, 18.139.194.139, 2406:da18:b3d:e200::64, 2406:da18:b3d:e201::64
Response IP 13.251.96.10
Found Yes
Hash 0df1e5d50eea3c879a535642361b848245004eaa1ad49eabfff44dbb9f22ce20
SimHash 58489cc02113

Groups

*

Rule Path
Disallow /images/
Disallow /js/
Disallow /css/

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

turnitin robot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemap.xml

Comments

  • Block SISTRIX
  • Block Uptime robot
  • Block Ezooms Robot
  • Block Perl LWP
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • Block WiseGuys Robot
  • Block Turnitin Robot
  • Block Heritrix
  • Block pricepi

Warnings

  • 4 invalid lines.