klimateibarat.com
robots.txt

Robots Exclusion Standard data for klimateibarat.com

Resource Scan

Scan Details

Site Domain klimateibarat.com
Base Domain klimateibarat.com
Scan Status Ok
Last Scan2026-03-03T00:45:10+00:00
Next Scan 2026-03-10T00:45:10+00:00

Last Scan

Scanned2026-03-03T00:45:10+00:00
URL https://klimateibarat.com/robots.txt
Domain IPs 104.21.47.95, 172.67.146.99, 2606:4700:3034::ac43:9263, 2606:4700:3037::6815:2f5f
Response IP 172.67.146.99
Found Yes
Hash 583efcdd55c055a3233d4e9970af8582aee27c215b8ad3f4278343101a1cc3e7
SimHash ef0edf7164a0

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

baiduspider

Rule Path
Allow /

yandexbot

Rule Path
Allow /
Disallow /admin/
Disallow /wp-admin/
Disallow /administrator/
Disallow /cpanel/
Disallow /phpmyadmin/
Disallow /tmp/
Disallow /temp/
Disallow /cache/
Disallow /logs/
Disallow /private/
Disallow /includes/
Disallow /config/
Allow /css/
Allow /js/
Allow /images/
Allow /img/
Allow /assets/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://klimateibarat.com/sitemap.xml
sitemap https://klimateibarat.com/sitemap.xml

Comments

  • Allow all search engines to crawl the site
  • Disallow crawling of admin areas (if any)
  • Disallow crawling of temporary files
  • Disallow crawling of private files
  • Allow crawling of important directories
  • Sitemap location
  • Crawl delay (optional - be respectful to server resources)
  • Additional sitemaps for different content types
  • Host directive (specify the preferred domain)

Warnings

  • `host` is not a known field.