termedia.pl
robots.txt

Robots Exclusion Standard data for termedia.pl

Resource Scan

Scan Details

Site Domain termedia.pl
Base Domain termedia.pl
Scan Status Ok
Last Scan2024-11-07T09:17:37+00:00
Next Scan 2024-11-14T09:17:37+00:00

Last Scan

Scanned2024-11-07T09:17:37+00:00
URL https://termedia.pl/robots.txt
Domain IPs 46.28.10.147
Response IP 46.28.10.147
Found Yes
Hash dacdc6bc87598befeebfe0cdafd5b996865ce2f75a8812d44317c83d7a38fc0e
SimHash 6314a8075735

Groups

*

Rule Path
Allow /f/
Allow /f/pages/
Disallow /portal-testowy/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Comments

  • Disallow: /f/f/
  • Disallow: /pobierz/
  • Disallow: /cms/
  • Disallow: /rs/
  • Disallow: /e/
  • Disallow: /f/posters/
  • Disallow: /f/proforma/
  • Disallow: /f/a_partners/
  • Disallow: /f/events_cert/
  • Disallow: /RestApi/
  • Disallow: /f/ads/
  • Disallow: /citation/
  • Disallow: /files/
  • Disallow: /pobierz/
  • Disallow: /*adFormat=*