ny1.com
robots.txt

Robots Exclusion Standard data for ny1.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ny1.com
Base Domain	ny1.com
Scan Status	Ok
Last Scan	2024-11-02T09:11:56+00:00
Next Scan	2024-11-09T09:11:56+00:00

Last Scan

Scanned	2024-11-02T09:11:56+00:00
URL	https://ny1.com/robots.txt
Domain IPs	23.20.208.149, 34.192.179.49, 44.196.53.233, 52.203.134.212
Response IP	23.20.208.149
Found	Yes
Hash	e1f2d90839ae86bd244469aacab98bb518b2172c20aaa6e3c03b7b23f75301ab
SimHash	6a0e8973fa97

Groups

*

Rule	Path
Allow	/$
Allow	/nyc/*
Allow	/sitemap.xml
Allow	/services/*
Allow	/etc/*
Allow	/content/*
Allow	/.well-known/assetlinks.json
Disallow	/nyc/all-boroughs/app-headlines.html
Disallow	queens.ny1.com.html
Disallow	/*
Disallow	/nyc/noticias
Disallow	/nyc/noticias/*
Disallow	/nyc/all-boroughs/app-headlines
Disallow	/nyc/all-boroughs/app-headlines*
Disallow	///partner-content/*
Disallow	/911-videos/*
Disallow	/content/news/stories/*

Rule

Path

Allow

/$

Allow

/nyc/*

Allow

/sitemap.xml

Allow

/services/*

Allow

/etc/*

Allow

/content/*

Allow

/.well-known/assetlinks.json

Disallow

/nyc/all-boroughs/app-headlines.html

Disallow

queens.ny1.com.html

Disallow

/*

Disallow

/nyc/noticias

Disallow

/nyc/noticias/*

Disallow

/nyc/all-boroughs/app-headlines

Disallow

/nyc/all-boroughs/app-headlines*

Disallow

/*/*/partner-content/*

Disallow

/911-videos/*

Disallow

/content/news/stories/*

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

twitterbot

Rule	Path
Disallow	/.well-known/

Rule

Path

Disallow

/.well-known/

Back to top

Other Records

Field	Value
sitemap	https://ny1.com/sitemap.xml

Field

Value

sitemap

https://ny1.com/sitemap.xml

Back to top

Comments

Allowed Paths
Excluded Pages
Excluded Paths
Additional Config

Back to top

ny1.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

twitterbot

Other Records

Comments

ny1.com
robots.txt