segre.com
robots.txt
Robots Exclusion Standard data for segre.com
Resource Scan
Scan Details
Site Domain | segre.com |
Base Domain | segre.com |
Scan Status | Ok |
Last Scan | 2024-11-09T14:34:09+00:00 |
Next Scan | 2024-11-16T14:34:09+00:00 |
Last Scan
Scanned | 2024-11-09T14:34:09+00:00 |
URL | https://segre.com/robots.txt |
Redirect | https://www.segre.com/robots.txt |
Redirect Domain | www.segre.com |
Redirect Base | segre.com |
Domain IPs | 18.154.41.70 |
Redirect IPs | 18.161.97.37, 18.161.97.38, 18.161.97.53, 18.161.97.60, 2600:9000:23d0:3400:1d:bf31:2680:93a1, 2600:9000:23d0:4400:1d:bf31:2680:93a1, 2600:9000:23d0:5e00:1d:bf31:2680:93a1, 2600:9000:23d0:7800:1d:bf31:2680:93a1, 2600:9000:23d0:7a00:1d:bf31:2680:93a1, 2600:9000:23d0:9200:1d:bf31:2680:93a1, 2600:9000:23d0:de00:1d:bf31:2680:93a1, 2600:9000:23d0:fa00:1d:bf31:2680:93a1 |
Response IP | 18.165.122.96 |
Found | Yes |
Hash | 4d4229302693ffed9909f84ab205c68b9e93d0bb980aeb78b8e129a453b4a03a |
SimHash | 8c08d25cdcb3 |
Groups
*
Rule | Path |
---|---|
Allow | / |
addthis.com disallow: /
admantx disallow: /
ahrefsbot disallow: /
anthropic-ai disallow: /
bdcbot disallow: /
bender disallow: /
bixocrawler disallow: /
bl.uk_lddc_bot disallow: /
blexbot disallow: /
bubing disallow: /
ccbot disallow: /
chatgpt-user disallow: /
cliqzbot disallow: /
cncdialer disallow: /
crawler4j disallow: /
crystalsemanticsbot disallow: /
cyberalert disallow: /
digext disallow: /
discobot disallow: /
discoverybot disallow: /
dloader disallow: /
dloader(naverrobot) disallow: /
doc disallow: /
dotbot disallow: /
download ninja disallow: /
dts agent disallow: /
exabot disallow: /
ezooms disallow: /
fairshare disallow: /
fetch disallow: /
flamingo_searchengine disallow: /
freshbot disallow: /
genieo disallow: /
gigabot disallow: /
gptbot disallow: /
grub-client disallow: /
heritrix disallow: /
heritrix/3.3.0 disallow: /
httrack disallow: /
ia_archiver disallow: /
integromedb disallow: /
istellabot disallow: /
jikespider disallow: /
jyxobot disallow: /
k2spider disallow: /
kimengi disallow: /
kimengi/nineconnections.com disallow: /
larbin disallow: /
lexxebot/1.0 disallow: /
libwww disallow: /
linko disallow: /
livelapbot disallow: /
magpie-crawler disallow: /
mail.ru disallow: /
maxthon disallow: /
metauri disallow: /
microsoft.url.control disallow: /
mj12bot disallow: /
moreover disallow: /
moreoverbot disallow: /
msiecrawler disallow: /
nabot disallow: /
naverbot disallow: /
nerdbynature.bot disallow: /
netestate ne crawler disallow: /
netseer crawler disallow: /
newscan disallow: /
nextgensearchbot disallow: /
npbot disallow: /
nutch disallow: /
offline explorer disallow: /
omgilibot disallow: /
orthogaffe disallow: /
piplbot disallow: /
pixray-seeker disallow: /
proximic disallow: /
psbot disallow: /
queryseekerspider disallow: /
rogerbot disallow: /
seokicks disallow: /
seokicks-robot disallow: /
sitebot disallow: /
sitebot/0.1 disallow: /
sitecheck.internetseer.com disallow: /
sitesnagger disallow: /
slurp disallow: /
sogou disallow: /
sosospider disallow: /
spbot disallow: /
spinn3r disallow: /
teleport disallow: /
teleportpro disallow: /
trendictionbot disallow: /
trovitbot disallow: /
turnitinbot disallow: /
ubicrawler disallow: /
umbot-ln disallow: /
unisterbot disallow: /
universalfeedparser disallow: /
wbsearchbot disallow: /
webcopier disallow: /
webreaper disallow: /
webstripper disallow: /
webzip disallow: /
wesee:search disallow: /
wget disallow: /
wotbot disallow: /
wotbox disallow: /
xenu disallow: /
yandex disallow: /
yasni disallow: /
zao disallow: /
zealbot disallow: /
zyborg disallow: /
claudebot disallow: /
claude-web disallow: /
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
sitemap | https://www.segre.com/sitemap-index.xml |
sitemap | https://www.segre.com/sitemap-google-news-ca.xml |
sitemap | https://www.segre.com/sitemap-google-news-es.xml |
Warnings
- 1 invalid line.
Comments