/.well-known/

Log In Sign Up

khabarfarsi.com
robots.txt

Robots Exclusion Standard data for khabarfarsi.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	khabarfarsi.com
Base Domain	khabarfarsi.com
Scan Status	Ok
Last Scan	5/11/2025, 8:52:50 AM
Next Scan	6/10/2025, 8:52:50 AM

Last Scan

Scanned	5/11/2025, 8:52:50 AM
URL	https://khabarfarsi.com/robots.txt
Domain IPs	104.21.79.144, 172.67.146.70, 2606:4700:3036::6815:4f90, 2606:4700:3036::ac43:9246
Response IP	172.67.146.70
Found	Yes
Hash	03b2b82baa2af2749dac94ecda6fea8b82760b93cad4263ca6f6217662f64334
SimHash	3894190acf54

Groups

*

Rule

Path

Disallow

/se_load_more_catnews

Disallow

/?q=se_load_more_catnews

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Files
Paths (clean URLs)
Paths (no clean URLs)

Back to top

Warnings

27 invalid lines.

Back to top