fpdf.cc
robots.txt

Robots Exclusion Standard data for fpdf.cc

Archived Snapshots

Resource Scan

Scan Details

Site Domain	fpdf.cc
Base Domain	fpdf.cc
Scan Status	Ok
Last Scan	2025-12-05T14:43:55+00:00
Next Scan	2025-12-12T14:43:55+00:00

Last Scan

Scanned	2025-12-05T14:43:55+00:00
URL	https://fpdf.cc/robots.txt
Domain IPs	104.21.73.222, 172.67.167.89, 2606:4700:3031::6815:49de, 2606:4700:3034::ac43:a759
Response IP	104.21.73.222
Found	Yes
Hash	301d62ecfc5121e9845b210660d0008202779cf83e2b2facba23ec84e1ecb173
SimHash	44f59b42c5d5

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	0.5

Field

Value

crawl-delay

0.5

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

slurp

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

baiduspider

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

yandexbot

Rule	Path
Allow	/
Disallow	/.git/
Disallow	/node_modules/
Disallow	/src/
Disallow	/.env
Disallow	/package.json
Disallow	/package-lock.json
Disallow	/yarn.lock
Disallow	/pnpm-lock.yaml
Disallow	/tsconfig.json
Disallow	/vite.config.ts
Disallow	/tailwind.config.js
Disallow	/postcss.config.js
Disallow	/eslint.config.js
Disallow	/.vercel/
Disallow	/.trae/
Allow	/images/
Allow	/favicon.ico
Allow	/favicon.svg
Allow	/ads.txt
Allow	/sitemap.xml
Allow	/robots.txt

Rule

Path

Allow

Disallow

/.git/

Disallow

/node_modules/

Disallow

/src/

Disallow

/.env

Disallow

/package.json

Disallow

/package-lock.json

Disallow

/yarn.lock

Disallow

/pnpm-lock.yaml

Disallow

/tsconfig.json

Disallow

/vite.config.ts

Disallow

/tailwind.config.js

Disallow

/postcss.config.js

Disallow

/eslint.config.js

Disallow

/.vercel/

Disallow

/.trae/

Allow

/images/

Allow

/favicon.ico

Allow

/favicon.svg

Allow

/ads.txt

Allow

/sitemap.xml

Allow

/robots.txt

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://fpdf.cc/sitemap.xml

Field

Value

sitemap

https://fpdf.cc/sitemap.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a content-signal = yes, you may collect content for the corresponding
use.
(b) If a content-signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a content signal for a
corresponding use, the website operator neither grants nor restricts
permission via content signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
Robots.txt for fpdf.cc
https://fpdf.cc/robots.txt
Last updated: 2024-12-19
Allow all web crawlers to access all content
Sitemap locations
Specific rules for major search engines
Block access to development and system files
Allow access to important files and directories
Block common bot patterns that might waste resources
Host directive for canonical domain

Warnings

`content-signal` is not a known field.
`host` is not a known field.

fpdf.ccrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

Other Records

googlebot

Other Records

bingbot

Other Records

slurp

Other Records

duckduckbot

Other Records

baiduspider

Other Records

yandexbot

Other Records

ahrefsbot

mj12bot

dotbot

semrushbot

Other Records

Comments

Warnings

fpdf.cc
robots.txt