databox.pt
robots.txt

Robots Exclusion Standard data for databox.pt

Resource Scan

Scan Details

Site Domain databox.pt
Base Domain databox.pt
Scan Status Ok
Last Scan2025-05-21T11:41:18+00:00
Next Scan 2025-06-20T11:41:18+00:00

Last Scan

Scanned2025-05-21T11:41:18+00:00
URL https://databox.pt/robots.txt
Domain IPs 195.23.61.72
Response IP 195.23.61.72
Found Yes
Hash 3e96bcb060d0f2cec0db7390e93c98916e968f37228b9315360d92181c13a1d9
SimHash 5f1642684488

Groups

bingbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mj12bot/v1.4.0

Rule Path
Disallow /

mj12bot/v1.2.4

Rule Path
Disallow /

mj12bot/v1.2.3

Rule Path
Disallow /

mj12bot/v1.0.8

Rule Path
Disallow /

mj12bot/v1.0.7

Rule Path
Disallow /

mj12bot/v1.0.6

Rule Path
Disallow /

mj12bot/v1.0.5

Rule Path
Disallow /

mj12bot/v1.4.8

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

buck/2.2

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

mozilla/5.0+(compatible;+seekport+crawler;+http://seekport.com/)

Rule Path
Disallow /

duckduckbot/1.0; (+http://duckduckgo.com/duckduckbot.html)

Rule Path
Disallow /

duckduckbot-https/1.1;+(+https://duckduckgo.com/duckduckbot)

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

facebookexternalhit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

facebookexternalhit/1.1

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

facebookexternalhit/*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

*

Rule Path
Disallow /umbraco/
Disallow /views/
Disallow /config/
Disallow /bin/
Disallow /App_Plugins/
Disallow /App_Code/
Disallow /App_Data/
Disallow /Umbraco_Client/
Disallow /configurations/
Disallow /Views_Plugins/
Disallow /Media/
Disallow /lib/

Other Records

Field Value
crawl-delay 20