triblive.com
robots.txt

Robots Exclusion Standard data for triblive.com

Resource Scan

Scan Details

Site Domain triblive.com
Base Domain triblive.com
Scan Status Ok
Last Scan2024-10-29T17:33:35+00:00
Next Scan 2024-11-05T17:33:35+00:00

Last Scan

Scanned2024-10-29T17:33:35+00:00
URL https://www.triblive.com/robots.txt
Domain IPs 13.58.26.36, 18.219.63.182
Response IP 13.58.26.36
Found Yes
Hash c08587be13d3efbba7af3c6a53c6ceac039d86e8678dce71a8f0dced05af84ba
SimHash d369f35d4f5f

Groups

*
mediapartners-google

Rule Path
Allow /

twitterbot

Rule Path
Disallow
Allow *

addthis
ahrefsbot
amazonadbot
archivebot
awariosmartbot
baiduspider
blackwidow
blexbot
ccbot
coccocbot
chinaclaw
clickagy
cliqzbot
custo
dotbot
demandbasepublisheranalyzer
disco
download\ demon
ecatch
exabot
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
grapeshot
grapeshotcrawler
grabnet
grafula
grammarly
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
linkisbot
mass\ downloader
mediawords
midown\ tool
mister\ pix
monsidobot
mj12bot
navroad
nearsite
netants
netspider
net\ vampire
netzip
ntentbot
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
proximic
pu_in
qwantify
realdownload
reget
riddler
rogerbot
scrapy
serpstatbot
semrushbot
semrushbot-sa
sitesnagger
sogou
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
the\ knowledge\ ai
trendictionbot
turnitinbot
tweetmemebot
voideye
velenpublicwebcrawler
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
yandex
zeus

Rule Path
Disallow /

Other Records

Field Value
sitemap https://triblive.com/sitemap.xml

Comments

  • Updated 08/10/21

Warnings

  • 1 invalid line.