revita.bg
robots.txt

Robots Exclusion Standard data for revita.bg

Archived Snapshots

Resource Scan

Scan Details

Site Domain	revita.bg
Base Domain	revita.bg
Scan Status	Ok
Last Scan	2024-09-22T07:50:25+00:00
Next Scan	2024-10-22T07:50:25+00:00

Last Scan

Scanned	2024-09-22T07:50:25+00:00
URL	https://revita.bg/robots.txt
Domain IPs	79.98.106.85
Response IP	79.98.106.85
Found	Yes
Hash	cf0c59aab61691c8b2c6409130d37f2b45a9b27b74fc1c13e92964ba723ad744
SimHash	98105d198524

Groups

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

geedoproductsearch

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

friendlycrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

image2dataset

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

censysinspect

Rule	Path
Disallow	/

Rule

Path

Disallow

expanse

Rule	Path
Disallow	/

Rule

Path

Disallow

internet-measurement

Rule	Path
Disallow	/

Rule

Path

Disallow

dataprovider

Rule	Path
Disallow	/

Rule

Path

Disallow

dalvik/2.1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

go-http-client

Rule	Path
Disallow	/

Rule

Path

Disallow

ioncrawl

Rule	Path
Disallow	/

Rule

Path

Disallow

isscyberriskcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

java

Rule	Path
Disallow	/

Rule

Path

Disallow

mozlila

Rule	Path
Disallow	/

Rule

Path

Disallow

orbbot

Rule	Path
Disallow	/

Rule

Path

Disallow

python-requests

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

bw/1.1

Rule	Path
Disallow	/

Rule

Path

Disallow

wp_is_mobile

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/modules/.css
Allow	/modules/.js
Allow	/modules/.png
Allow	/modules/.jpg
Allow	/js/jquery/*

Rule

Path

Allow

*/modules/*.css

Allow

*/modules/*.js

Allow

*/modules/*.png

Allow

*/modules/*.jpg

Allow

/js/jquery/*

Other Records

Field	Value
sitemap	https://revita.bg/sitemap_blog.xml
sitemap	https://revita.bg/sitemap-index.xml
sitemap	https://revita.bg/sitemap-with-images-index.xml

Field

Value

sitemap

https://revita.bg/sitemap_blog.xml

sitemap

https://revita.bg/sitemap-index.xml

sitemap

https://revita.bg/sitemap-with-images-index.xml

Comments

robots.txt automatically generated by PrestaShop e-commerce open-source solution
http://www.prestashop.com - http://www.prestashop.com/forums
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
Blocking some bad bots
Allow Directives
Sitemaps

revita.bgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

claudebot

geedoproductsearch

anthropic-ai

bytespider

ccbot

chatgpt-user

friendlycrawler

image2dataset

imagesiftbot

omgilibot

barkrowler

blexbot

dataforseobot

dotbot

mj12bot

censysinspect

expanse

internet-measurement

dataprovider

dalvik/2.1.0

go-http-client

ioncrawl

isscyberriskcrawler

java

mozlila

orbbot

python-requests

scrapy

bw/1.1

wp_is_mobile

zoominfobot

*

Other Records

Comments

revita.bg
robots.txt