howtolovecomics.com
robots.txt

Robots Exclusion Standard data for howtolovecomics.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	howtolovecomics.com
Base Domain	howtolovecomics.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-12-02T06:20:15+00:00
Next Scan	2026-01-01T06:20:15+00:00

Last Successful Scan

Scanned	2025-11-02T00:48:42+00:00
URL	https://howtolovecomics.com/robots.txt
Domain IPs	104.21.33.110, 172.67.161.225, 2606:4700:3035::6815:216e, 2606:4700:3037::ac43:a1e1
Response IP	104.21.33.110
Found	Yes
Hash	d6b3d040f14320094f7201683a2c9e612fc0875617d5dfe02b0235a898f07ae5
SimHash	420c4988e13f

Groups

*

Rule	Path
Disallow	/wp-content/uploads/wpo-plugins-tables-list.json

Rule

Path

Disallow

/wp-content/uploads/wpo-plugins-tables-list.json

criteobot/0.1

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-or/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-va/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-sg/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-ie/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-jp/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ias-au/3.3

Rule	Path
Disallow	/

Rule

Path

Disallow

ahc/2.1

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili/0.5

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

um-ln

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

mozlila/5.0

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

velenpublicwebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

trafficbot.live

Rule	Path
Disallow	/

Rule

Path

Disallow

bot-traffic.icu

Rule	Path
Disallow	/

Rule

Path

Disallow

bottraffic.live

Rule	Path
Disallow	/

Rule

Path

Disallow

gammatraffic.com

Rule	Path
Disallow	/

Rule

Path

Disallow

trafficmarket.me

Rule	Path
Disallow	/

Rule

Path

Disallow

extratraffic.shop

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.howtolovecomics.com/sitemap_index.xml

Field

Value

sitemap

https://www.howtolovecomics.com/sitemap_index.xml

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

howtolovecomics.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

criteobot/0.1

ias-or/3.3

ias-va/3.3

ias-sg/3.3

ias-ie/3.3

ias-jp/3.3

ias-au/3.3

ahc/2.1

omgili/0.5

chatgpt-user

proximic

um-ln

bytespider

mozlila/5.0

anthropic-ai

velenpublicwebcrawler

trafficbot.live

bot-traffic.icu

bottraffic.live

gammatraffic.com

trafficmarket.me

extratraffic.shop

google-extended

imagesiftbot

Other Records

Comments

howtolovecomics.com
robots.txt