am1jx.terre-islam.fr
robots.txt

Robots Exclusion Standard data for am1jx.terre-islam.fr

Archived Snapshots

Resource Scan

Scan Details

Site Domain	am1jx.terre-islam.fr
Base Domain	terre-islam.fr
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-07-30T08:34:56+00:00
Next Scan	2024-10-28T08:34:56+00:00

Last Successful Scan

Scanned	2023-06-14T08:30:19+00:00
URL	https://am1jx.terre-islam.fr/robots.txt
Domain IPs	104.21.68.240, 172.67.200.69, 2606:4700:3033::6815:44f0, 2606:4700:3037::ac43:c845
Response IP	104.21.68.240
Found	Yes
Hash	9a237ff7bd39ee9f69779d77caf4af88be799885614e7e7037ec1efdc7045b02
SimHash	423c43406733

Groups

*

Rule	Path
Disallow	/wp-admin

Rule

Path

Disallow

/wp-admin

acunetix

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

trendictionbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

pinterestbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bidswitchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot-web

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

owler

Rule	Path
Disallow	/

Rule

Path

Disallow

tracemyfile

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

safednsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

hybridbot

Rule	Path
Disallow	/

Rule

Path

Disallow

feedly

Rule	Path
Disallow	/

Rule

Path

Disallow

feedburner

Rule	Path
Disallow	/

Rule

Path

Disallow

boardreader

Rule	Path
Disallow	/

Rule

Path

Disallow

theoldreader.com

Rule	Path
Disallow	/

Rule

Path

Disallow

semantic-visions.com

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

weborama-fetcher

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ias_crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://am1jx.terre-islam.fr/sitemap.xml

Field

Value

sitemap

https://am1jx.terre-islam.fr/sitemap.xml

Comments

Interested in similar dооrways production? :)
Let's discuss our cooperation! Telegram: @DryBox

am1jx.terre-islam.frrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

acunetix

ahrefsbot

yandex

googlebot

trendictionbot

applebot

grapeshot

semrushbot

semrushbot-sa

mj12bot

dotbot

petalbot

pinterestbot

getintent crawler

bidswitchbot

baiduspider

linkdexbot

coccocbot-web

femtosearchbot

owler

tracemyfile

ccbot

safednsbot

hybridbot

feedly

feedburner

boardreader

theoldreader.com

semantic-visions.com

proximic

weborama-fetcher

rogerbot

ias_crawler

Other Records

Comments

am1jx.terre-islam.fr
robots.txt