truyencv.org
robots.txt

Robots Exclusion Standard data for truyencv.org

Resource Scan

Scan Details

Site Domain truyencv.org
Base Domain truyencv.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-04-13T12:14:56+00:00
Next Scan 2025-05-13T12:14:56+00:00

Last Successful Scan

Scanned2025-03-15T12:05:34+00:00
URL https://truyencv.org/robots.txt
Redirect https://metruyenchu.org/robots.txt
Redirect Domain metruyenchu.org
Redirect Base metruyenchu.org
Domain IPs 104.21.30.205, 172.67.173.221, 2606:4700:3031::6815:1ecd, 2606:4700:3034::ac43:addd
Redirect IPs 104.21.95.108, 172.67.144.162, 2606:4700:3036::6815:5f6c, 2606:4700:3037::ac43:90a2
Response IP 104.21.95.108
Found Yes
Hash e370901534ddc5f0e60b99f40561293ee65fe2a3ad5ab08a646f36720f0719f2
SimHash 32af6443f062

Groups

*

Rule Path
Allow /
Disallow /doubleclick/
Disallow /eyeblaster/
Disallow /tim-kiem/
Disallow /404/

sentibot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

advbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

publiclibraryarchive.org

Rule Path
Disallow /

memorybot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

abonti

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

mixbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

bubing

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

screenerbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

bpimagewalker/2.0

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

feedbooster

Rule Path
Disallow /

nutch

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

spbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

exb language crawler

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://metruyenchu.org/sitemap.xml

Comments

  • 2015.06.27 crawler for SentiOne
  • 2015.04.06 SEO indexer
  • 2015.02.10 AdvBot "classify web content"
  • 2015.01.30 XoviBot SEO bot
  • 2015.02.19 ??? parked domain
  • 2014.12.26. Internet Memory Research
  • 2014.09.26. SimilarTech, Lead Generation, Competitive Intelligence based on Web Tech Analysis
  • 2014.09.26. XOVI Suite, SEO & Online Marketing Tool
  • 2014.09.18. WebSearch
  • 2014.09.11. The web search API
  • entries without date
  • SEO services
  • panscient.com
  • tiscali.it search bot
  • search engine
  • search engine
  • Mixdata : data for big business
  • chinese search engine
  • chinese search engine
  • scalable, fully distributed crawler
  • ??? search engine
  • search engine
  • the Internet Archive's open-source, extensible, scalable, archival-quality Web crawler
  • kostenlose Backlinkchecker von Torsten R«äckert Internetdiestleistungen
  • part of Ware Bay Best Buys Search engine
  • Web crawler
  • analyses the structure of the WWW
  • search engine
  • seo
  • brand protection
  • seo
  • seo
  • search engine
  • seo
  • plagiarism check
  • search engine www.sengine.info
  • news
  • Apache Nutch based
  • news portal
  • seo moz
  • seo
  • seo
  • language