akhbarak.net
robots.txt

Robots Exclusion Standard data for akhbarak.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	akhbarak.net
Base Domain	akhbarak.net
Scan Status	Ok
Last Scan	2024-09-22T19:11:59+00:00
Next Scan	2024-09-29T19:11:59+00:00

Last Scan

Scanned	2024-09-22T19:11:59+00:00
URL	https://akhbarak.net/robots.txt
Domain IPs	104.21.71.199, 172.67.171.157, 2606:4700:3034::ac43:ab9d, 2606:4700:3037::6815:47c7
Response IP	172.67.171.157
Found	Yes
Hash	e5d5d896cd130e1a2cd8efed0b218ca3447e3e12ee07e350b0dba5eca75b7721
SimHash	ab891e1de5d1

Groups

*

Rule	Path
Disallow	/admin/
Disallow	/articles/
Disallow	/topics/limited/
Disallow	/tags/limited/
Disallow	/clusters/limited/
Disallow	//sort/
Disallow	/*/by_date
Disallow	/*/by_score
Disallow	/*/by_date/
Disallow	/*/by_score/
Disallow	/blog/tag/*
Disallow	/blog/author/*

Rule

Path

Disallow

/admin/

Disallow

/articles/

Disallow

/topics/limited/

Disallow

/tags/limited/

Disallow

/clusters/limited/

Disallow

/*/sort/*

Disallow

/*/by_date

Disallow

/*/by_score

Disallow

/*/by_date/

Disallow

/*/by_score/

Disallow

/blog/tag/*

Disallow

/blog/author/*

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

Back to top

Other Records

Field	Value
sitemap	https://akhbarak.net/sitemaps/static.xml
sitemap	https://akhbarak.net/sitemaps/tags.xml
sitemap	https://akhbarak.net/sitemaps/sections.xml
sitemap	https://akhbarak.net/sitemaps/sources.xml
sitemap	https://akhbarak.net/videos.xml
sitemap	https://akhbarak.net/galleries.xml

Field

Value

sitemap

https://akhbarak.net/sitemaps/static.xml

sitemap

https://akhbarak.net/sitemaps/tags.xml

sitemap

https://akhbarak.net/sitemaps/sections.xml

sitemap

https://akhbarak.net/sitemaps/sources.xml

sitemap

https://akhbarak.net/videos.xml

sitemap

https://akhbarak.net/galleries.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
Disallow: /*.js$
User-Agent: *
Disallow: /

Back to top

akhbarak.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Other Records

Comments

akhbarak.net
robots.txt