knewtoday.net
robots.txt

Robots Exclusion Standard data for knewtoday.net

Resource Scan

Scan Details

Site Domain knewtoday.net
Base Domain knewtoday.net
Scan Status Ok
Last Scan2024-11-05T20:43:17+00:00
Next Scan 2024-11-12T20:43:17+00:00

Last Scan

Scanned2024-11-05T20:43:17+00:00
URL https://knewtoday.net/robots.txt
Domain IPs 2a02:4780:39:353f:a80f:3ace:1090:f108, 84.32.84.9
Response IP 93.127.187.137
Found Yes
Hash def7c7127d867e46cd9b58f97f9b593657af26621594a95c5271556e37b5285d
SimHash 66a5d343493a

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow */disclaimer/*
Disallow *?attachment_id=
Disallow /privacy-policy

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /wp-content/uploads/

applebot

Rule Path
Allow /

yandex

Rule Path
Allow /

yandeximages

Rule Path
Allow /wp-content/uploads/

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

qwantify

Rule Path
Allow /

*

Rule Path
Allow /ads.txt

*

Rule Path
Allow /app-ads.txt

Other Records

Field Value
crawl-delay 5

Comments

  • Allow/Disallow Ads.txt
  • Allow/Disallow App-ads.txt
  • robots.txt file for YouTube
  • Created in the distant future (the year 2000) after
  • the robotic uprising of the mid 90\\\'s which wiped out all humans.
  • User-agent: Mediapartners-Google*
  • Disallow:
  • User-agent: *
  • Disallow: /comment
  • Disallow: /feeds/videos.xml
  • Disallow: /get_video
  • Disallow: /get_video_info
  • Disallow: /get_midroll_info
  • Disallow: /live_chat
  • Disallow: /login
  • Disallow: /qr
  • Disallow: /results
  • Disallow: /signup
  • Disallow: /t/terms
  • Disallow: /timedtext_video
  • Disallow: /verify_age
  • Disallow: /watch_ajax
  • Disallow: /watch_fragments_ajax
  • Disallow: /watch_popup
  • Disallow: /watch_queue_ajax
  • Sitemap: https://www.youtube.com/sitemaps/sitemap.xml
  • Sitemap: https://www.youtube.com/product/sitemap.xml
  • This robots.txt file was created by Better Robots.txt (Index & Rank Booster by Pagup) Plugin. https://www.better-robots.com/