macaubusiness.com
robots.txt
Robots Exclusion Standard data for macaubusiness.com
Resource Scan
Scan Details
Site Domain | macaubusiness.com |
Base Domain | macaubusiness.com |
Scan Status | Ok |
Last Scan | 2024-10-04T11:52:01+00:00 |
Next Scan | 2024-11-03T11:52:01+00:00 |
Last Scan
Scanned | 2024-10-04T11:52:01+00:00 |
URL | https://macaubusiness.com/robots.txt |
Domain IPs | 172.66.40.123, 172.66.43.133, 2606:4700:3108::ac42:287b, 2606:4700:3108::ac42:2b85 |
Response IP | 172.66.40.123 |
Found | Yes |
Hash | cc58bd38b038192e80ff61c8dcc090a4730ab748aa891fe30abc70999a362dcc |
SimHash | d369e3578a47 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-login.php |
Disallow | /wp-admin/ |
ahrefsbot
amazonbot
baiduspider
baiduspider-image
baiduspider-video
blackwidow
bytespider
chinaclaw
custo
dataforseobot
disco
dotbot
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
exabot
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
mj12bot
mojeekbot
mojeekbot
navroad
nearsite
net\ vampire
netants
netspider
netvibes
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
petalbot
realdownload
reget
rogerbot
semrushbot
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
yandex
zeus
Rule | Path |
---|---|
Disallow | / |
Warnings
- 4 invalid lines.