listjar.com
robots.txt

Robots Exclusion Standard data for listjar.com

Resource Scan

Scan Details

Site Domain listjar.com
Base Domain listjar.com
Scan Status Ok
Last Scan2024-11-15T02:36:54+00:00
Next Scan 2024-11-29T02:36:54+00:00

Last Scan

Scanned2024-11-15T02:36:54+00:00
URL https://listjar.com/robots.txt
Domain IPs 104.21.86.144, 172.67.220.148, 2606:4700:3032::6815:5690, 2606:4700:3033::ac43:dc94
Response IP 172.67.220.148
Found Yes
Hash a20e5e4eb337daf02eeb4f440d5104b7bafe63e75a972bb5b79c616b6d2212b5
SimHash 0934895606b3

Groups

*

Rule Path
Disallow /Identity/*
Disallow /authentication/*
Disallow /*.cshtml$
Disallow /*.json$
Disallow /*.app.json$
Disallow /*.config$
Disallow /help/*
Disallow /t/*
Disallow /mylists/*
Disallow /account/*
Disallow /note/*
Disallow /message/*
Disallow /weather/*
Disallow /signin/*
Disallow /register/*
Disallow /emreq/*
Disallow /regcnf/*
Disallow /pwrcnf/*
Disallow /emccnf/*
Disallow /articles/*

Other Records

Field Value
crawl-delay 5

voltron

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://listjar.com/sitemap.xml

Comments

  • _
  • [ ]
  • ( )
  • |>|
  • __/===\__
  • //| o=o |\\
  • <] | o=o | [>
  • \=====/
  • / / | \ \
  • <_________>
  • Got a List? Put it in the Jar! ListJar!

Warnings

  • 2 invalid lines.