lgbtru.com
robots.txt

Robots Exclusion Standard data for lgbtru.com

Resource Scan

Scan Details

Site Domain lgbtru.com
Base Domain lgbtru.com
Scan Status Ok
Last Scan2024-09-27T00:45:09+00:00
Next Scan 2024-10-27T00:45:09+00:00

Last Scan

Scanned2024-09-27T00:45:09+00:00
URL https://lgbtru.com/robots.txt
Domain IPs 142.132.196.168, 145.239.67.120, 172.104.232.45
Response IP 145.239.67.120
Found Yes
Hash 2df0a39e2042a543c413fc26d9fbda9e087aa9df1198bf5de805adc623c36b10
SimHash 9369b4fb4e0b

Groups

websauger
webfetch
surfbot
larbin
download demon
image stripper
grafula
ahrefsbot
webleacher
smartdownload
eyenetie
pcbrowser
jetcar
web sucker
mj12bot
webauto
zeus
webwhacker
leechftp
express webpictures
webcopier
wwwoffle
xaldon webspider
ahrefsbot/5.1
dotbot
emailwolf
flashget
voideye
go-ahead-got-it
superbot
midown tool
blackwidow
wget
indy library
image sucker
superhttp
teleport pro
papa foto
webzip
netspider
bot [email="craftbot@yahoo.com"]mailto:craftbot@yahoo.com[/email]
httrack
semrushbot/1.1~bl
webgo is
extractorpro
webstripper
getright
eirgrabber
interget
mister pix
exabot
website quester
getweb!
hmview
custo
nearsite
disco
ecatch
offline explorer
mass downloader
web image collector
joc web spider
realdownload
gigabot
takeout
linkpadbot
webreaper
sitesnagger
rogerbot
net vampire
netzip
reget
internet ninja
grabnet
navroad
pavuk
website extractor
netants
octopus
pagegrabber
emailsiphon
chinaclaw
semrushbot
widow
go!zilla
offline navigator

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20