apnarena.com
robots.txt

Robots Exclusion Standard data for apnarena.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	apnarena.com
Base Domain	apnarena.com
Scan Status	Ok
Last Scan	2026-03-30T22:04:05+00:00
Next Scan	2026-04-06T22:04:05+00:00

Last Scan

Scanned	2026-03-30T22:04:05+00:00
URL	https://apnarena.com/robots.txt
Domain IPs	104.21.75.20, 172.67.210.60, 2606:4700:3032::ac43:d23c, 2606:4700:3036::6815:4b14
Response IP	172.67.210.60
Found	Yes
Hash	8e869326d72873fb92018d87a30a647b06c03797a4f62ac9a71b1e951fd3be25
SimHash	c765a3538c57

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

cloudflarebrowserrenderingcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*
mediapartners-google

Rule	Path
Allow	/ads/preferences/
Allow	/gpt/
Allow	/pagead/show_ads.js
Allow	/pagead/js/adsbygoogle.js
Allow	/pagead/js/*/show_ads_impl.js

Rule

Path

Allow

/ads/preferences/

Allow

/gpt/

Allow

/pagead/show_ads.js

Allow

/pagead/js/adsbygoogle.js

Allow

/pagead/js/*/show_ads_impl.js

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow
Disallow	/cgi-bin/
Disallow	*/?no_cache=1
Disallow	/?ez_force_cookie_consent=1
Disallow	/embed/
Disallow	*/?expand_article=1
Disallow	/?*
Disallow	/detroitchicago/
Disallow	/porpoiseant/
Disallow	/beardeddragon/
Disallow	/tardisrocinante/
Disallow	/parsonsmaize/
Disallow	/edomontonalberta/
Disallow	/ezais/

Rule

Path

Disallow

/cgi-bin/

Disallow

*/?no_cache=1

Disallow

/?ez_force_cookie_consent=1

Disallow

/embed/

Disallow

*/?expand_article=1

Disallow

/?*

Disallow

/detroitchicago/

Disallow

/porpoiseant/

Disallow

/beardeddragon/

Disallow

/tardisrocinante/

Disallow

/parsonsmaize/

Disallow

/edomontonalberta/

Disallow

/ezais/

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus
gptbot
semrushbot
bytespider
amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a Content-Signal = yes, you may collect content for the corresponding
use.
(b) If a Content-Signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a Content-Signal for a
corresponding use, the website operator neither grants nor restricts
permission via Content-Signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content

Warnings

`content-signal` is not a known field.

apnarena.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

cloudflarebrowserrenderingcrawler

google-extended

gptbot

meta-externalagent

*mediapartners-google

googlebot

*

Comments

Warnings

apnarena.com
robots.txt

*
mediapartners-google