lfmodels.com
robots.txt

Robots Exclusion Standard data for lfmodels.com

Resource Scan

Scan Details

Site Domain lfmodels.com
Base Domain lfmodels.com
Scan Status Ok
Last Scan2025-12-01T05:20:49+00:00
Next Scan 2025-12-31T05:20:49+00:00

Last Scan

Scanned2025-12-01T05:20:49+00:00
URL https://lfmodels.com/robots.txt
Domain IPs 104.21.23.100, 172.67.210.118, 2606:4700:3034::6815:1764, 2606:4700:3034::ac43:d276
Response IP 104.21.23.100
Found Yes
Hash fe24dae8c3517c7a43d69b39d1ccfdd1dfce85cc90bd1f9e97202ef2cc0e65bc
SimHash 547151408cd4

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • NOTE: This is only an example file.
  • If you don't already have a robots.txt file, you may RENAME this one to robots.txt
  • **** IF YOU USE THIS SAMPLE FILE **** ... THEN YOU SHOULD remove all comments from this file simply by deleting any lines starting with a
  • ***** Zen Cart doesn't require any specific exclusions for normal operation of Zen Cart activities. *****
  • ***** You can fine-tune or customize things according to your own needs related to search-engine results, but that is entirely outside the scope of normal Zen Cart operation. *****
  • If you wish to prevent indexing of your /images folder, add a line that says:
  • Disallow: /images
  • (but remove the # from the beginning of the line! )
  • And if you wish your popup pages to not be indexed, you can add this also:
  • Disallow: /index.php?main_page=popup_image*
  • (again, remove the # )
  • Do not list any private folders here ... otherwise their existence is no longer private.
  • Your robots.txt file should go in your /public_html or /httpdocs folder,
  • (even if your Zen Cart installation might be in a subfolder). Adjust any folder paths accordingly.
  • For additional reference on settings for robots.txt files, refer to:
  • http://www.robotstxt.org/wc/exclusion.html
  • http://en.wikipedia.org/wiki/Robots.txt
  • * Example robots.txt file
  • * @access private
  • * @license http://www.zen-cart.com/license/2_0.txt GNU Public License V2.0
  • * @version $Id: DrByte 2020 Jul 10 Modified in v1.5.8-alpha $

Warnings

  • 1 invalid line.
  • `content-signal` is not a known field.