it.openfoodfacts.org
robots.txt

Robots Exclusion Standard data for it.openfoodfacts.org

Resource Scan

Scan Details

Site Domain it.openfoodfacts.org
Base Domain openfoodfacts.org
Scan Status Ok
Last Scan2024-09-21T19:32:23+00:00
Next Scan 2024-10-05T19:32:23+00:00

Last Scan

Scanned2024-09-21T19:32:23+00:00
URL https://it.openfoodfacts.org/robots.txt
Domain IPs 213.36.253.214
Response IP 213.36.253.214
Found Yes
Hash 58f60fbfc43384282cd2d7c3feb9dd48784a79875d5f4b4899637f4e85c04f01
SimHash 485c8591d744

Groups

*

Rule Path
Allow /cgi/product_image.pl
Allow /cgi/opensearch.pl
Disallow /cgi
Disallow /code
Disallow /api
Disallow /additives
Disallow /allergen
Disallow /allergens
Disallow /amino-acid
Disallow /amino-acids
Disallow /brands
Disallow /categories
Disallow /categories-properties
Disallow /checker
Disallow /checkers
Disallow /city
Disallow /cities
Disallow /code
Disallow /codes
Disallow /corrector
Disallow /correctors
Disallow /country
Disallow /countries
Disallow /data-quality
Disallow /data-quality-bug
Disallow /data-quality-bugs
Disallow /data-quality-error
Disallow /data-quality-errors
Disallow /data-quality-error-producers
Disallow /data-quality-errors-producers
Disallow /data-quality-info
Disallow /data-quality-warning
Disallow /data-quality-warnings
Disallow /data-quality-warning-producers
Disallow /data-quality-warnings-producers
Disallow /data-source
Disallow /data-sources
Disallow /debug
Disallow /editor
Disallow /editors
Disallow /packager-code
Disallow /packager-codes
Disallow /entry-date
Disallow /entry-dates
Disallow /food-group
Disallow /food-groups
Disallow /import
Disallow /imports
Disallow /possible-improvement
Disallow /possible-improvements
Disallow /informer
Disallow /informers
Disallow /ingredient
Disallow /ingredients
Disallow /ingredients-analysis
Disallow /ingredients-from-palm-oil
Disallow /number-of-ingredients
Disallow /numbers-of-ingredients
Disallow /ingredient-original
Disallow /ingredients-original
Disallow /ingredients-that-may-be-from-palm-oil
Disallow /known-nutrient
Disallow /known-nutrients
Disallow /labels
Disallow /language
Disallow /languages
Disallow /last-check-date
Disallow /last-check-dates
Disallow /last-edit-date
Disallow /last-edit-dates
Disallow /last-image-date
Disallow /last-image-dates
Disallow /manufacturing-place
Disallow /manufacturing-places
Disallow /mineral
Disallow /minerals
Disallow /misc
Disallow /mission
Disallow /missions
Disallow /nova-groups
Disallow /nucleotide
Disallow /nucleotides
Disallow /nutrient-level
Disallow /nutrient-levels
Disallow /nutrient
Disallow /nutrients
Disallow /nutri-score
Disallow /nutri-score-2021
Disallow /nutri-score-2023
Disallow /nutrition-grades
Disallow /origin
Disallow /origins
Disallow /other-nutritional-substance
Disallow /other-nutritional-substances
Disallow /owner
Disallow /owners
Disallow /packaging
Disallow /packaging-materials
Disallow /packaging-recycling
Disallow /packaging-shape
Disallow /packaging-shapes
Disallow /period-after-opening
Disallow /periods-after-opening
Disallow /photographer
Disallow /photographers
Disallow /pnns-group-1
Disallow /pnns-groups-1
Disallow /pnns-group-2
Disallow /pnns-groups-2
Disallow /popularity
Disallow /purchase-place
Disallow /purchase-places
Disallow /quality
Disallow /state
Disallow /states
Disallow /store
Disallow /stores
Disallow /team
Disallow /teams
Disallow /trace
Disallow /traces
Disallow /unknown-nutrient
Disallow /unknown-nutrients
Disallow /contributor
Disallow /contributors
Disallow /vitamin
Disallow /vitamins
Disallow /weigher
Disallow /weighers

bingbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

cliqzbot/3.0

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seekport bot

Rule Path
Disallow /

seekport

Rule Path
Disallow /

paracrawl

Rule Path
Disallow /

scrapy/1.5.0

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

velenpublicwebcrawler (velen.io)

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

semrushbot/2~bl

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

yandexmarket

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Comments

  • Disallow: Bingbot (temporary test)
  • Disallow: SEOkicks-Robot
  • http://www.opensiteexplorer.org/dotbot
  • http://www.searchmetrics.com
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://www.domaintools.com/webmasters/surveybot.php
  • http://www.seodiver.com/bot
  • http://openlinkprofiler.org/bot
  • http://www.wotbox.com/bot/