spicyip.com
robots.txt

Robots Exclusion Standard data for spicyip.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	spicyip.com
Base Domain	spicyip.com
Scan Status	Ok
Last Scan	2024-10-21T03:30:41+00:00
Next Scan	2024-11-20T03:30:41+00:00

Last Scan

Scanned	2024-10-21T03:30:41+00:00
URL	https://spicyip.com/robots.txt
Domain IPs	104.21.79.91, 172.67.143.12, 2606:4700:3030::ac43:8f0c, 2606:4700:3034::6815:4f5b
Response IP	172.67.143.12
Found	Yes
Hash	50f26699383dddd495b4cf96b953731222e186ff492469258d301aa6c00db3bd
SimHash	7b94535acb74

Groups

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

scoutjet

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

alexibot

Rule	Path
Disallow	/

Rule

Path

Disallow

aqua_products

Rule	Path
Disallow	/

Rule

Path

Disallow

asterias

Rule	Path
Disallow	/

Rule

Path

Disallow

b2w/0.1

Rule	Path
Disallow	/

Rule

Path

Disallow

backdoorbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

blowfish/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

bookmark search tool

Rule	Path
Disallow	/

Rule

Path

Disallow

botalot

Rule	Path
Disallow	/

Rule

Path

Disallow

botrighthere

Rule	Path
Disallow	/

Rule

Path

Disallow

builtbottough

Rule	Path
Disallow	/

Rule

Path

Disallow

bullseye/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

bunnyslippers

Rule	Path
Disallow	/

Rule

Path

Disallow

cheesebot

Rule	Path
Disallow	/

Rule

Path

Disallow

cherrypicker

Rule	Path
Disallow	/

Rule

Path

Disallow

cherrypickerelite/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

cherrypickerse/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

copernic

Rule	Path
Disallow	/

Rule

Path

Disallow

copyrightcheck

Rule	Path
Disallow	/

Rule

Path

Disallow

cosmos

Rule	Path
Disallow	/

Rule

Path

Disallow

crescent internet toolpak http ole control v.1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

crescent

Rule	Path
Disallow	/

Rule

Path

Disallow

dittospyder

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

emailwolf

Rule	Path
Disallow	/

Rule

Path

Disallow

erocrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

extractorpro

Rule	Path
Disallow	/

Rule

Path

Disallow

fairad client

Rule	Path
Disallow	/

Rule

Path

Disallow

flaming attackbot

Rule	Path
Disallow	/

Rule

Path

Disallow

foobot

Rule	Path
Disallow	/

Rule

Path

Disallow

gaisbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getright/4.2

Rule	Path
Disallow	/

Rule

Path

Disallow

harvest/1.5

Rule	Path
Disallow	/

Rule

Path

Disallow

hloader

Rule	Path
Disallow	/

Rule

Path

Disallow

httplib

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack 3.0

Rule	Path
Disallow	/

Rule

Path

Disallow

humanlinks

Rule	Path
Disallow	/

Rule

Path

Disallow

infonavirobot

Rule	Path
Disallow	/

Rule

Path

Disallow

iron33/1.0.2

Rule	Path
Disallow	/

Rule

Path

Disallow

jennybot

Rule	Path
Disallow	/

Rule

Path

Disallow

kenjin spider

Rule	Path
Disallow	/

Rule

Path

Disallow

keyword density/0.9

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

lexibot

Rule	Path
Disallow	/

Rule

Path

Disallow

libweb/clshttp

Rule	Path
Disallow	/

Rule

Path

Disallow

linkextractorpro

Rule	Path
Disallow	/

Rule

Path

Disallow

linkscan/8.1a unix

Rule

Path

Disallow

linkwalker

Rule

Path

Disallow

lnspiderguy

Rule

Path

Disallow

lwp-trivial/1.34

Rule

Path

Disallow

lwp-trivial

Rule

Path

Disallow

mata hari

Rule

Path

Disallow

microsoft url control - 5.01.4511

Rule

Path

Disallow

microsoft url control - 6.00.8169

Rule

Path

Disallow

microsoft url control

Rule

Path

Disallow

miixpc/4.2

Rule

Path

Disallow

miixpc

Rule

Path

Disallow

mister pix

Rule

Path

Disallow

moget/2.1

Rule

Path

Disallow

moget

Rule

Path

Disallow

mozilla/4.0 (compatible; bullseye; windows 95)

Rule

Path

Disallow

msiecrawler

Rule

Path

Disallow

netants

Rule

Path

Disallow

nicerspro

Rule

Path

Disallow

offline explorer

Rule

Path

Disallow

openbot

Rule

Path

Disallow

openfind data gatherer

Rule

Path

Disallow

openfind

Rule

Path

Disallow

oracle ultra search

Rule

Path

Disallow

perman

Rule

Path

Disallow

propowerbot/2.14

Rule

Path

Disallow

prowebwalker

Rule

Path

Disallow

psbot

Rule

Path

Disallow

python-urllib

Rule

Path

Disallow

queryn metasearch

Rule

Path

Disallow

radiation retriever 1.1

Rule

Path

Disallow

repomonkey bait & tackle/v1.01

Rule

Path

Disallow

repomonkey

Rule

Path

Disallow

rma

Rule

Path

Disallow

searchpreview

Rule

Path

Disallow

sitesnagger

Rule

Path

Disallow

spankbot

Rule

Path

Disallow

spanner

Rule

Path

Disallow

suzuran

Rule

Path

Disallow

szukacz/1.4

Rule

Path

Disallow

teleport

Rule

Path

Disallow

teleportpro

Rule

Path

Disallow

telesoft

Rule

Path

Disallow

the intraformant

Rule

Path

Disallow

thenomad

Rule

Path

Disallow

tighttwatbot

Rule

Path

Disallow

tocrawl/urldispatcher

Rule

Path

Disallow

true_robot/1.0

Rule

Path

Disallow

true_robot

Rule

Path

Disallow

turingos

Rule

Path

Disallow

turnitinbot/1.5

Rule

Path

Disallow

turnitinbot

Rule

Path

Disallow

url control

Rule

Path

Disallow

url_spider_pro

Rule

Path

Disallow

urly warning

Rule

Path

Disallow

vci webviewer vci webviewer win32

Rule

Path

Disallow

vci

Rule

Path

Disallow

web image collector

Rule

Path

Disallow

webauto

Rule

Path

Disallow

webbandit/3.50

Rule

Path

Disallow

webbandit

Rule

Path

Disallow

webcapture 2.0

Rule

Path

Disallow

webcopier v.2.2

Rule

Path

Disallow

webcopier v3.2a

Rule

Path

Disallow

webcopier

Rule

Path

Disallow

webenhancer

Rule

Path

Disallow

websauger

Rule

Path

Disallow

website quester

Rule

Path

Disallow

webster pro

Rule

Path

Disallow

webstripper

Rule

Path

Disallow

webzip/4.0

Rule

Path

Disallow

webzip/4.21

Rule

Path

Disallow

webzip/5.0

Rule

Path

Disallow

webzip

Rule

Path

Disallow

wget/1.5.3

Rule

Path

Disallow

wget/1.6

Rule

Path

Disallow

wget

Rule

Path

Disallow

wget

Rule

Path

Disallow

www-collector-e

Rule

Path

Disallow

xenu's link sleuth 1.1c

Rule

Path

Disallow

xenu's

Rule

Path

Disallow

zeus 32297 webster pro v2.9 win32

Rule

Path

Disallow

zeus link scout

Rule

Path

Disallow

zeus

Rule

Path

Disallow

adsbot-google

Rule

Path

Disallow

googlebot

Rule

Path

Disallow

mediapartners-google

Rule

Path

Disallow

*

Rule

Path

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/INSTALL.sqlite.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/wp-includes/

Disallow

/filter/tips/

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/user/logout/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=filter%2Ftips%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Disallow

/?q=user%2Flogout%2F

Disallow

/wp-admin/

Disallow

/administrator/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/images/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/logs/

Disallow

/media/

Disallow

/modules/

Disallow

/plugins/

Disallow

/templates/

Disallow

/tmp/

Disallow

/wp-includes/js

Disallow

/trackback

Disallow

/tag

Disallow

/category/*/*

Disallow

*/trackback

Disallow

/*?*

Disallow

/*?

Disallow

/*~*

Disallow

/*~

Other Records

Field

Value

crawl-delay

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

spicyip.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

ahrefsbot

Other Records

scoutjet

Other Records

alexibot

aqua_products

asterias

b2w/0.1

backdoorbot/1.0

blowfish/1.0

bookmark search tool

botalot

botrighthere

builtbottough

bullseye/1.0

bunnyslippers

cheesebot

cherrypicker

cherrypickerelite/1.0

cherrypickerse/1.0

copernic

copyrightcheck

cosmos

crescent internet toolpak http ole control v.1.0

crescent

dittospyder

emailcollector

emailsiphon

emailwolf

erocrawler

extractorpro

fairad client

flaming attackbot

foobot

gaisbot

getright/4.2

harvest/1.5

hloader

httplib

httrack 3.0

humanlinks

infonavirobot

iron33/1.0.2

jennybot

kenjin spider

keyword density/0.9

larbin

lexibot

libweb/clshttp

linkextractorpro

linkscan/8.1a unix

linkwalker

lnspiderguy

lwp-trivial/1.34

lwp-trivial

mata hari

microsoft url control - 5.01.4511

microsoft url control - 6.00.8169

microsoft url control

miixpc/4.2

miixpc

mister pix

moget/2.1

moget

mozilla/4.0 (compatible; bullseye; windows 95)

msiecrawler

netants

nicerspro

offline explorer

openbot

openfind data gatherer

openfind

oracle ultra search

perman

propowerbot/2.14

spicyip.com
robots.txt