ncrusz.pl
robots.txt

Robots Exclusion Standard data for ncrusz.pl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ncrusz.pl
Base Domain	ncrusz.pl
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-08-16T00:02:32+00:00
Next Scan	2024-11-14T00:02:32+00:00

Last Successful Scan

Scanned	2023-06-28T16:27:08+00:00
URL	https://ncrusz.pl/robots.txt
Domain IPs	104.21.57.188, 172.67.165.119
Response IP	172.67.165.119
Found	Yes
Hash	b6d35e9ff87e74730b1be2a96fa12b03956ec8cb7a5e214e2238aca78153350a
SimHash	523c55406533

Groups

*

Rule	Path
Disallow	/wp-admin

Rule

Path

Disallow

/wp-admin

acunetix

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

trendictionbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

pinterestbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bidswitchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot-web

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

owler

Rule	Path
Disallow	/

Rule

Path

Disallow

tracemyfile

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

safednsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

hybridbot

Rule	Path
Disallow	/

Rule

Path

Disallow

feedly

Rule	Path
Disallow	/

Rule

Path

Disallow

feedburner

Rule	Path
Disallow	/

Rule

Path

Disallow

boardreader

Rule	Path
Disallow	/

Rule

Path

Disallow

theoldreader.com

Rule	Path
Disallow	/

Rule

Path

Disallow

semantic-visions.com

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

weborama-fetcher

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ias_crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://ncrusz.pl/sitemap.xml

Field

Value

sitemap

https://ncrusz.pl/sitemap.xml

Comments

Interested in similar dооrways production? :)
Let's discuss our cooperation! Telegram: @DryBox

ncrusz.plrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

acunetix

ahrefsbot

yandex

googlebot

trendictionbot

applebot

grapeshot

semrushbot

semrushbot-sa

mj12bot

dotbot

petalbot

pinterestbot

getintent crawler

bidswitchbot

baiduspider

linkdexbot

coccocbot-web

femtosearchbot

owler

tracemyfile

ccbot

safednsbot

hybridbot

feedly

feedburner

boardreader

theoldreader.com

semantic-visions.com

proximic

weborama-fetcher

rogerbot

ias_crawler

Other Records

Comments

ncrusz.pl
robots.txt