promoscuolaweb.it
robots.txt

Robots Exclusion Standard data for promoscuolaweb.it

Archived Snapshots

Resource Scan

Scan Details

Site Domain	promoscuolaweb.it
Base Domain	promoscuolaweb.it
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-10-28T08:35:09+00:00
Next Scan	2025-01-26T08:35:09+00:00

Last Successful Scan

Scanned	2023-06-14T08:30:19+00:00
URL	https://promoscuolaweb.it/robots.txt
Domain IPs	104.21.73.152, 172.67.190.84, 2606:4700:3031::6815:4998, 2606:4700:3031::ac43:be54
Response IP	172.67.190.84
Found	Yes
Hash	2d944c6e691fa1409c2c3ddc26f588afdc78d6d72229245071e30426007cccab
SimHash	523c4f406513

Groups

*

Rule	Path
Disallow	/wp-admin

Rule

Path

Disallow

/wp-admin

acunetix

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

trendictionbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

pinterestbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bidswitchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot-web

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

owler

Rule	Path
Disallow	/

Rule

Path

Disallow

tracemyfile

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

safednsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

hybridbot

Rule	Path
Disallow	/

Rule

Path

Disallow

feedly

Rule	Path
Disallow	/

Rule

Path

Disallow

feedburner

Rule	Path
Disallow	/

Rule

Path

Disallow

boardreader

Rule	Path
Disallow	/

Rule

Path

Disallow

theoldreader.com

Rule	Path
Disallow	/

Rule

Path

Disallow

semantic-visions.com

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

weborama-fetcher

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ias_crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://promoscuolaweb.it/sitemap.xml

Field

Value

sitemap

https://promoscuolaweb.it/sitemap.xml

Comments

Interested in similar dооrways production? :)
Let's discuss our cooperation! Telegram: @DryBox

promoscuolaweb.itrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

acunetix

ahrefsbot

yandex

googlebot

trendictionbot

applebot

grapeshot

semrushbot

semrushbot-sa

mj12bot

dotbot

petalbot

pinterestbot

getintent crawler

bidswitchbot

baiduspider

linkdexbot

coccocbot-web

femtosearchbot

owler

tracemyfile

ccbot

safednsbot

hybridbot

feedly

feedburner

boardreader

theoldreader.com

semantic-visions.com

proximic

weborama-fetcher

rogerbot

ias_crawler

Other Records

Comments

promoscuolaweb.it
robots.txt