unicourt.com
robots.txt

Robots Exclusion Standard data for unicourt.com

Resource Scan

Scan Details

Site Domain unicourt.com
Base Domain unicourt.com
Scan Status Ok
Last Scan2024-10-20T05:16:40+00:00
Next Scan 2024-11-19T05:16:40+00:00

Last Scan

Scanned2024-10-20T05:16:40+00:00
URL https://unicourt.com/robots.txt
Domain IPs 18.232.41.176, 18.233.109.115, 23.22.191.140, 34.195.19.118, 34.197.12.33
Response IP 34.195.19.118
Found Yes
Hash f9e2b91110233dce4e30090ea68ca92dee8df5c389bc436efbbc435b519a1021
SimHash 4319c34a9090

Groups

*

Rule Path
Disallow /ga/
Disallow /lp/
Disallow /form/
Disallow /search/*
Disallow /search
Disallow /case/sitemap
Disallow /courts/new-courtsystem-courthouse-sitemap.xml
Disallow /case/userSegment/*
Disallow /editorial*
Disallow /case/urc*
Disallow /case/removeRecord*

adsbot-google
adsbot-google-mobile

Rule Path
Allow /

neevabot
uptimerobot
orbbot
mj12bot
bytespider
google-extended
gptbot
chatgpt-user
ccbot
anthropic-ai
claude-web
yandex
sistrix
sistrix crawler
sistrix
seokicks-robot
searchmetricsbot
seodiver
dotbot
meanpathbot
backlinkcrawler
megaindex.ru
megaindex.com
screaming frog seo spider
piplbot
uptimebot
siteauditbot
semrushbot
semrushbot-sa
amazonbot
claudebot
omgilibot
omgili
facebookbot
imagesiftbot
cohere-ai
perplexitybot
brightbot
facebookexternalhit/1.1
facebookcatalog/1.0
bytedance
babbar.tech
diffbot
barkrowler
seekportbot
awariosmartbot/1.0
ahrefsbot
petalbot
dataforseobot
meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://unicourt.com/sitemap.xml
sitemap https://unicourt.com/sitemap-blog.xml
sitemap https://unicourt.com/sitemap-careers.xml
sitemap https://unicourt.com/courts-sitemap-index.xml
sitemap https://unicourt.com/case/other-sitemaps/redacted-cases-sitemap.xml

Comments

  • Block bots