tech-room.pl
robots.txt

Robots Exclusion Standard data for tech-room.pl

Resource Scan

Scan Details

Site Domain tech-room.pl
Base Domain tech-room.pl
Scan Status Ok
Last Scan2024-11-06T14:25:00+00:00
Next Scan 2024-11-13T14:25:00+00:00

Last Scan

Scanned2024-11-06T14:25:00+00:00
URL https://tech-room.pl/robots.txt
Domain IPs 109.125.199.36
Response IP 109.125.199.36
Found Yes
Hash 3eb7d2b5121b129f53f88400e937c70870fbe5e48f681b1220a57b9ff98d3285
SimHash 6b095a76e693

Groups

googlebot-news

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Allow /wp-includes/*.css
Allow /wp-includes/*.js
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /cache/
Disallow /admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/
Disallow /*?*s=
Disallow /*%26s%3D
Disallow /category/*/page/
Disallow /tag/*/page/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /wp-content/backup/
Disallow /wp-content/uploads/wpo-wcpdf-*/

googlebot-image

Rule Path
Allow /wp-content/uploads/

Other Records

Field Value
sitemap https://tech-room.pl/sitemap_index.xml

Comments

  • Zezwól na indeksowanie ważnych zasobów
  • Blokuj niepotrzebne parametry URL
  • Blokuj strony paginacji dla kategorii i tagów
  • Blokuj strony logowania i rejestracji
  • Blokuj pliki tymczasowe i kopie zapasowe
  • Instrukcje dla konkretnych botów
  • Mapy witryn