5terka.com
robots.txt

Robots Exclusion Standard data for 5terka.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	5terka.com
Base Domain	5terka.com
Scan Status	Ok
Last Scan	2024-06-03T17:44:08+00:00
Next Scan	2024-06-10T17:44:08+00:00

Last Scan

Scanned	2024-06-03T17:44:08+00:00
URL	https://5terka.com/robots.txt
Domain IPs	45.128.206.165
Response IP	45.128.206.165
Found	Yes
Hash	9a76be31fee23854efe52231728903d540c499e04b93739f62aae52421f610b4
SimHash	bc121d0bc574

Groups

*

Rule	Path
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/profit
Disallow	/examer
Disallow	/studlance
Disallow	/war
Disallow	/konkurs
Disallow	/geekbrains
Disallow	/repetitor

Rule

Path

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/profit

Disallow

/examer

Disallow

/studlance

Disallow

/war

Disallow

/konkurs

Disallow

/geekbrains

Disallow

/repetitor

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Directories

Back to top

Warnings

`host` is not a known field.

Back to top

5terka.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Comments

Warnings

5terka.com
robots.txt