ncseaa.edu
robots.txt

Robots Exclusion Standard data for ncseaa.edu

Resource Scan

Scan Details

Site Domain ncseaa.edu
Base Domain ncseaa.edu
Scan Status Ok
Last Scan2024-09-28T11:55:35+00:00
Next Scan 2024-10-28T11:55:35+00:00

Last Scan

Scanned2024-09-28T11:55:35+00:00
URL https://www.ncseaa.edu/robots.txt
Domain IPs 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Response IP 23.185.0.4
Found Yes
Hash bd2be041685271505a512da0fd6f8e62d281697ffa01f38c909ff72783180acc
SimHash 2abca894d2a5

Groups

timpibot
seekportbot

Rule Path
Disallow /

siteimprovebot-crawler

Rule Path
Disallow */wp-login.php
Disallow */wp-json/*
Disallow */wp-admin/*
Disallow */?attachment_id=*
Disallow */?s=*
Disallow */?taxonomy=nav_menu*
Disallow */?eventDisplay=past*
Disallow */?eventDisplay=photo*
Disallow */?post_type=tribe_events&eventDisplay=day*
Disallow */?post_type=tribe_events&eventDisplay=week*
Disallow */?post_type=tribe_events&eventDisplay=month*
Disallow */?tribe-bar-date=*
Disallow *%26eventDisplay%3Dpast*
Disallow *%26eventDisplay%3Dphoto*
Disallow *%26tribe-bar-date%3D*
Disallow */2009/*
Disallow */2010/*
Disallow */2011/*
Disallow */2012/*
Disallow */2013/*
Disallow */2014/*
Disallow */2015/*
Disallow */2016/*
Disallow */2017/*
Disallow */2018/*
Disallow */2019/*
Disallow */2020/*
Disallow */2021/*
Disallow */2022/*
Disallow */2023/*
Disallow */author/*
Disallow */category/*
Disallow */events/*
Disallow */organizer/*
Disallow */scripts/webalert.js?__ver=*
Disallow */tag/*
Disallow */venue/*

Other Records

Field Value
crawl-delay 3

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.ncseaa.edu/sitemap_index.xml

Comments

  • Blocking these bots
  • Site Improve blocking
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK