healthybiteshq.com
robots.txt

Robots Exclusion Standard data for healthybiteshq.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	healthybiteshq.com
Base Domain	healthybiteshq.com
Scan Status	Ok
Last Scan	2024-11-14T07:37:28+00:00
Next Scan	2024-11-21T07:37:28+00:00

Last Scan

Scanned	2024-11-14T07:37:28+00:00
URL	https://healthybiteshq.com/robots.txt
Domain IPs	192.250.234.179
Response IP	192.250.234.179
Found	Yes
Hash	74865349c3c3a17ae0ca1aea54eb9bcbed0d1d688f4f197993751628cd145a53
SimHash	5769f2538f63

Groups

*

Rule	Path
Allow	/
Allow	/wp-admin/admin-ajax.php
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/tag/
Disallow	/author/
Disallow	/category/
Disallow	/attachments/
Disallow	/archives/
Disallow	/?
Disallow	/dir/

Rule

Path

Allow

/wp-admin/admin-ajax.php

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/tag/

Disallow

/author/

Disallow

/category/

Disallow

/attachments/

Disallow

/archives/

Disallow

/*?*

Disallow

/dir/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

neevabot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

adsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

riddler

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/
Disallow	/?utm_source=

Rule

Path

Disallow

/?utm_source=

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot/2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot/2.0 (http://commoncrawl.org/faq/)

Rule	Path
Disallow	/

Rule

Path

Disallow

wikido

Rule	Path
Disallow	/

Rule

Path

Disallow

fr_crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-image

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-video

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-favo

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-news

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-cpro

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider-ads

Rule	Path
Disallow	/

Rule

Path

Disallow

trendictionbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bitvorebot

Rule	Path
Disallow	/

Rule

Path

Disallow

blp_bbot

Rule	Path
Disallow	/

Rule

Path

Disallow

heritrix

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

kraken

Rule	Path
Disallow	/

Rule

Path

Disallow

moatbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bhcbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

synthesio

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

brandonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

germcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

maxpointcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

admantx

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	https://healthybiteshq.com/sitemap.xml

Field

Value

sitemap

https://healthybiteshq.com/sitemap.xml

Comments

New crawlers to block 2016

Warnings

`host` is not a known field.

healthybiteshq.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

turnitinbot

mj12bot

blexbot

dotbot

neevabot

dataforseobot

adsbot

riddler

petalbot

ccbot

ccbot/2.0

ccbot/2.0 (http://commoncrawl.org/faq/)

wikido

fr_crawler

yandex

baiduspider

baiduspider-image

baiduspider-video

baiduspider-favo

baiduspider-news

baiduspider-cpro

baiduspider-ads

trendictionbot

bitvorebot

blp_bbot

heritrix

magpie-crawler

kraken

moatbot

bhcbot

semrushbot

synthesio

ahrefsbot

brandonbot

germcrawler

sogou

exabot

maxpointcrawler

admantx

Other Records

Other Records

Comments

Warnings

healthybiteshq.com
robots.txt