thenewscommenter.com
robots.txt

Robots Exclusion Standard data for thenewscommenter.com

Resource Scan

Scan Details

Site Domain thenewscommenter.com
Base Domain thenewscommenter.com
Scan Status Ok
Last Scan2024-11-16T11:26:44+00:00
Next Scan 2024-11-23T11:26:44+00:00

Last Scan

Scanned2024-11-16T11:26:44+00:00
URL https://thenewscommenter.com/robots.txt
Redirect https://www.thegatewaypundit.com/robots.txt
Redirect Domain www.thegatewaypundit.com
Redirect Base thegatewaypundit.com
Domain IPs 104.26.14.78, 104.26.15.78, 172.67.70.111, 2606:4700:20::681a:e4e, 2606:4700:20::681a:f4e, 2606:4700:20::ac43:466f
Redirect IPs 104.22.4.85, 104.22.5.85, 172.67.41.88, 2606:4700:10::6816:455, 2606:4700:10::6816:555, 2606:4700:10::ac43:2958
Response IP 172.67.41.88
Found Yes
Hash 8de900925826761862da3a2064caed23f624588c5367d96feedbf4d47f5e0ccb
SimHash 78035802eca8

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

parler
parlerstaging
bingbot
msnbot
msnbot-media

Rule Path
Disallow /?s=
Disallow /?
Disallow /?*
Disallow /search/
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /members/
Disallow /admin_page/
Disallow /admin_page/*
Disallow /campaign/
Disallow /twitter/
Disallow /youtube/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 120

*

Rule Path
Disallow /*?p=*
Disallow /*%26p%3D*
Disallow /*?s=*
Disallow /*%26s%3D*
Disallow /*?ical=1
Disallow /*%26ical%3D1
Disallow /*?tribe-bar-date=*
Disallow /*%26tribe-bar-date%3D*
Disallow /?author=*
Disallow /*wp-comments*
Disallow /*wp-trackback*
Disallow /*wp-feed*
Disallow /*replytocom%3D*
Disallow /*?preview=*
Disallow /*%26preview%3D*
Disallow /*add-to-cart%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*cart/*
Disallow /*checkout/*
Disallow /*my-account/*
Disallow /*myaccount/*
Disallow /*?ajaxCalendar=1*
Allow /*/plugins/*

ahrefsbot

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thegatewaypundit.com/sitemap_index.xml
sitemap https://www.thegatewaypundit.com/news-sitemap.xml
sitemap https://www.thegatewaypundit.com/video-sitemap.xml

Comments

  • Start Robots Customizations
  • Stop bots from crawling junk URLs
  • End Robots Customizations