pressenet.info
robots.txt

Robots Exclusion Standard data for pressenet.info

Resource Scan

Scan Details

Site Domain pressenet.info
Base Domain pressenet.info
Scan Status Ok
Last Scan2025-05-24T04:55:59+00:00
Next Scan 2025-05-31T04:55:59+00:00

Last Scan

Scanned2025-05-24T04:55:59+00:00
URL https://pressenet.info/robots.txt
Redirect https://www.pressenet.info/robots.txt
Redirect Domain www.pressenet.info
Redirect Base pressenet.info
Domain IPs 2001:8d8:100f:f000::290, 217.160.0.203
Redirect IPs 2001:8d8:100f:f000::290, 217.160.0.203
Response IP 217.160.0.203
Found Yes
Hash d4fd672cd2f604e44132e0dc2d55ef01fbcaf64aab08c78073811f8c7ac05886
SimHash 4b3577d0f609

Groups

adsbot-google
adsbot-google-mobile
applebot
bingbot
bingbot/2.0
exabot
feedfetcher-google
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googleweblight
google-sitemaps
google-searchbyimage
mediapartners-google
msnbot
slurp
teoma

Rule Path
Disallow

ahrefsbot
baiduspider
beautybot
blexbot
ccbot/2.0
chatgpt-user
discobot
gptbot
ia_archiver
infotigerbot/1.9
ioncrawl
linguee
mail.ru
megaindex.ru
megaindex.ru/2.0
mj12bot/v1.4.8
perplexitybot
petalbot
petalbot mobile
seekport crawler
semrushbot
serpstatbot
sitelockspider
twitterbot
yandex
zoominfobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pressenet.info/sitemap-texte.xml