apollo.lv
robots.txt

Robots Exclusion Standard data for apollo.lv

Resource Scan

Scan Details

Site Domain apollo.lv
Base Domain apollo.lv
Scan Status Ok
Last Scan2024-11-07T08:05:03+00:00
Next Scan 2024-11-14T08:05:03+00:00

Last Scan

Scanned2024-11-07T08:05:03+00:00
URL https://apollo.lv/robots.txt
Redirect https://www.apollo.lv/robots.txt
Redirect Domain www.apollo.lv
Redirect Base apollo.lv
Domain IPs 185.154.221.187, 185.154.221.188
Redirect IPs 185.154.221.187, 185.154.221.188
Response IP 185.154.221.187
Found Yes
Hash a0623854a80ee177e18060291d535365c998b849cd1dc6f17efe5f4f6ea1e513
SimHash 0b0f46604ab1

Groups

*

Rule Path
Disallow /search*
Disallow /latest/*
Disallow /*/print/*
Disallow /print/*
Disallow /*/com/*
Disallow /mobile/*
Disallow /rest/*
Disallow /feed/*
Disallow /weather/*
Disallow /?schedule=*
Disallow /author/*

bingbot
msnbot
msnbot-media
yandexbot
ahrefsbot
seekportbot

Rule Path
Disallow /search*
Disallow /latest/*
Disallow /*/print/*
Disallow /print/*
Disallow /*/com/*
Disallow /mobile/*
Disallow /?schedule=*
Disallow /rest/*
Disallow /feed/*
Disallow /weather/*
Disallow /author/*

Other Records

Field Value
crawl-delay 60

mediapartners-google

Rule Path
Disallow

discobot
dotbot
yacybot
petalbot
semrushbot
grapeshot
barkrowler

Rule Path
Disallow *

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

femtosearchbot

Rule Path
Disallow *

Other Records

Field Value
sitemap https://www.apollo.lv/sitemap