fvnki.town
robots.txt

Robots Exclusion Standard data for fvnki.town

Archived Snapshots

Resource Scan

Scan Details

Site Domain	fvnki.town
Base Domain	fvnki.town
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't establish SSL connection.
Last Scan	2025-12-12T13:55:31+00:00
Next Scan	2026-03-12T13:55:31+00:00

Last Successful Scan

Scanned	2023-11-17T01:32:40+00:00
URL	https://fvnki.town/robots.txt
Domain IPs	46.139.3.54
Response IP	46.139.3.54
Found	Yes
Hash	76892531dbef8c5f3988713d3dc8db9cc84c661241f7cadaa7d698bd50e3ce29
SimHash	783bdb1ecd4c

Groups

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

wellknownbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/api/
Disallow	/auth/
Disallow	/oauth/
Disallow	/check_your_email
Disallow	/wait_for_approval
Disallow	/account_disabled
Disallow	/.well-known/
Disallow	/fileserver/
Disallow	/users/
Disallow	/emoji/
Disallow	/admin
Disallow	/user
Disallow	/settings/
Disallow	/about/suspended

Rule

Path

Disallow

/api/

Disallow

/auth/

Disallow

/oauth/

Disallow

/check_your_email

Disallow

/wait_for_approval

Disallow

/account_disabled

Disallow

/.well-known/

Disallow

/fileserver/

Disallow

/users/

Disallow

/emoji/

Disallow

/admin

Disallow

/user

Disallow

/settings/

Disallow

/about/suspended

Other Records

Field	Value
crawl-delay	500

Field

Value

crawl-delay

500

Comments

GoToSocial robots.txt -- to edit, see internal/web/robots.go
More info @ https://developers.google.com/search/docs/crawling-indexing/robots/intro
Before we commence, a giant fuck you to ChatGPT in particular.
https://platform.openai.com/docs/gptbot
As of September 2023, GPTBot and ChatGPT-User are equivalent. But there's no telling
when OpenAI might decide to change that, so block this one too.
And a giant fuck you to Google Bard and their other generative AI ventures too.
https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers
Block CommonCrawl. Used in training LLMs and specifically GPT-3.
https://commoncrawl.org/faq
Block Omgilike/Webz.io, a "Big Web Data" engine.
https://webz.io/blog/web-data/what-is-the-omgili-bot-and-why-is-it-crawling-your-website/
Block Faceboobot, because Meta.
https://developers.facebook.com/docs/sharing/bot
Well-known.dev crawler. Indexes stuff under /.well-known.
https://well-known.dev/about/
Rules for everything else.
API endpoints.
Auth/login endpoints.
Well-known endpoints.
Fileserver/media.
Fedi S2S API endpoints.
Settings panels.
Domain blocklist.

fvnki.townrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

gptbot

chatgpt-user

google-extended

ccbot

omgilibot

facebookbot

wellknownbot

*

Other Records

Comments

fvnki.town
robots.txt