indian.community
robots.txt

Robots Exclusion Standard data for indian.community

Archived Snapshots

Resource Scan

Scan Details

Site Domain	indian.community
Base Domain	indian.community
Scan Status	Ok
Last Scan	2026-03-18T23:28:52+00:00
Next Scan	2026-03-25T23:28:52+00:00

Last Scan

Scanned	2026-03-18T23:28:52+00:00
URL	https://indian.community/robots.txt
Domain IPs	104.26.14.138, 104.26.15.138, 172.67.71.177, 2606:4700:20::681a:e8a, 2606:4700:20::681a:f8a, 2606:4700:20::ac43:47b1
Response IP	172.67.71.177
Found	Yes
Hash	760d456352f0c13918d9e422f420708422e444d2c213cd574e5111c3d65fc866
SimHash	44354bd3c5d5

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

cloudflarebrowserrenderingcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-news

Rule	Path
Allow	/

Rule

Path

Allow

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

googleother

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-image

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

facebot

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-login.php
Disallow	/xmlrpc.php
Disallow	/cgi-bin/
Disallow	/?s=
Disallow	/?attachment_id=
Disallow	/trackback/
Disallow	/comments/feed/
Allow	/wp-admin/admin-ajax.php
Allow	/feed/
Allow	/*/feed/
Allow	/category/*/feed/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-login.php

Disallow

/xmlrpc.php

Disallow

/cgi-bin/

Disallow

/?s=

Disallow

/?attachment_id=

Disallow

/trackback/

Disallow

/comments/feed/

Allow

/wp-admin/admin-ajax.php

Allow

/feed/

Allow

/*/feed/

Allow

/category/*/feed/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://indian.community/sitemap_index.xml
sitemap	https://indian.community/news-sitemap.xml

Field

Value

sitemap

https://indian.community/sitemap_index.xml

sitemap

https://indian.community/news-sitemap.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a Content-Signal = yes, you may collect content for the corresponding
use.
(b) If a Content-Signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a Content-Signal for a
corresponding use, the website operator neither grants nor restricts
permission via Content-Signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
--- Core: allow search engines full access to content ---
--- WordPress admin hygiene ---
--- IMPORTANT: allow feeds (needed for News/Publisher Center & freshness) ---
--- Sitemaps ---
--- Optional: AI bot policy (keep if you want) ---
Block specific training bots if you must

Warnings

`content-signal` is not a known field.

indian.communityrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

cloudflarebrowserrenderingcrawler

google-extended

gptbot

meta-externalagent

googlebot

googlebot-news

facebookexternalhit

googleother

googlebot-image

bingbot

facebookexternalhit

facebot

*

gptbot

chatgpt-user

google-extended

claudebot

ccbot

bytespider

Other Records

Comments

Warnings

indian.community
robots.txt