airccse.org
robots.txt

Robots Exclusion Standard data for airccse.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	airccse.org
Base Domain	airccse.org
Scan Status	Ok
Last Scan	2024-05-02T02:42:38+00:00
Next Scan	2024-06-01T02:42:38+00:00

Last Scan

Scanned	2024-05-02T02:42:38+00:00
URL	https://airccse.org/robots.txt
Domain IPs	69.167.168.245
Response IP	69.167.168.245
Found	Yes
Hash	e71863c3a790c4cd304bd73669b53b33ea1c7ba299251b0975b85de11c10406c
SimHash	01319051c6b6

Groups

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

slurp

Rule	Path
Allow	/

Rule

Path

Allow

openfind

Rule	Path
Allow	/

Rule

Path

Allow

scooter

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

twiceler

Rule	Path
Allow	/

Rule

Path

Allow

rogerbot

Rule	Path
Allow	/

Rule

Path

Allow

teoma

Rule	Path
Allow	/

Rule

Path

Allow

mantraagent

Rule	Path
Allow	/

Rule

Path

Allow

semanticscholarbot

Rule	Path
Allow	/

Rule

Path

Allow

lycos_spider_(t-rex)

Rule	Path
Allow	/

Rule

Path

Allow

robozilla

Rule	Path
Allow	/

Rule

Path

Allow

zyborg

Rule	Path
Allow	/

Rule

Path

Allow

ia_archiver

Rule	Path
Allow	/

Rule

Path

Allow

gulliver

Rule	Path
Allow	/

Rule

Path

Allow

echo2

Rule	Path
Allow	/

Rule

Path

Allow

scoutjet

Rule	Path
Allow	/

Rule

Path

Allow

yahoofeedseeker

Rule	Path
Allow	/

Rule

Path

Allow

bloglines

Rule	Path
Allow	/

Rule

Path

Allow

blogstreetbot

Rule	Path
Allow	/

Rule

Path

Allow

fastbuzz.com

Rule	Path
Allow	/

Rule

Path

Allow

syndic8

Rule	Path
Allow	/

Rule

Path

Allow

nif/1.1

Rule	Path
Allow	/

Rule

Path

Allow

newsgatoronline

Rule	Path
Allow	/

Rule

Path

Allow

mywireservicebot

Rule	Path
Allow	/

Rule

Path

Allow

feedster

Rule	Path
Allow	/

Rule

Path

Allow

feedfetcher

Rule	Path
Allow	/
Disallow	/sgw/
Disallow	/covers/
Disallow	/*checkval
Disallow	/*wicket%3Ainterface

Rule

Path

Allow

Disallow

/sgw/

Disallow

/covers/

Disallow

/*checkval

Disallow

/*wicket%3Ainterface

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

ezooms

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	http://airccse.org/sitemap.xml

Field

Value

sitemap

http://airccse.org/sitemap.xml

Comments

all others

airccse.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

mediapartners-google

adsbot-google

slurp

openfind

scooter

bingbot

twiceler

rogerbot

teoma

mantraagent

semanticscholarbot

lycos_spider_(t-rex)

robozilla

zyborg

ia_archiver

gulliver

echo2

scoutjet

yahoofeedseeker

bloglines

blogstreetbot

fastbuzz.com

syndic8

nif/1.1

newsgatoronline

mywireservicebot

feedster

feedfetcher

ahrefsbot

baiduspider

ezooms

mj12bot

yandexbot

*

Other Records

Other Records

Comments

airccse.org
robots.txt