transportator.md
robots.txt

Robots Exclusion Standard data for transportator.md

Resource Scan

Scan Details

Site Domain transportator.md
Base Domain transportator.md
Scan Status Ok
Last Scan2026-02-24T10:22:54+00:00
Next Scan 2026-03-03T10:22:54+00:00

Last Scan

Scanned2026-02-24T10:22:54+00:00
URL https://transportator.md/robots.txt
Domain IPs 89.42.218.13
Response IP 89.42.218.13
Found Yes
Hash 534b10652c29ad40736e735be5d60f23d46028c1f539b699cb0a1da433894773
SimHash 3d36db02e680

Groups

*

Rule Path
Allow /
Allow /assets/
Allow /images/
Allow /css/
Allow /js/
Allow /uploads/
Disallow /assets/inc/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /private/
Disallow /admin/
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /sendemail.php
Disallow /sendtelegram.php
Disallow /*?*sort=
Disallow /*?*session=
Disallow /*?*filter=

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://transportator.md/sitemap.xml

Comments

  • robots.txt for https://transportator.md
  • SEO-friendly and permissive configuration for full indexation
  • 1. Allow all crawlers to access public content
  • 2. Block access to sensitive or system directories
  • 3. Prevent indexing of URLs with session or filter parameters
  • 4. Optional crawl delay (ignored by Google, used by Bing/Yandex)
  • 5. Sitemap reference – essential for search engines
  • 6. Notes:
  • - This setup allows full indexation of all public pages and resources.
  • - Googlebot, Bingbot, and other major crawlers will have access to CSS, JS, and images.
  • - Backend PHP and admin directories remain protected.