Update docs and tag v1.3.0.

parent 3942e276
......@@ -17,9 +17,6 @@
[submodule "boxomon"]
path = boxomon
url = git@gitlab.palpant.us:justin/boxomon.git
[submodule "k8s-node-termination-handler"]
path = k8s-node-termination-handler
url = git@github.com:GoogleCloudPlatform/k8s-node-termination-handler.git
[submodule "justinpalpant-website"]
path = justinpalpant-website
url = git@gitlab.palpant.us:justin/justinpalpant-website.git
......@@ -32,3 +29,12 @@
[submodule "backup-to-dropbox"]
path = backup-to-dropbox
url = git@gitlab.palpant.us:justin/rclone-to-dropbox.git
[submodule "postgres-exporter"]
path = postgres-exporter
url = git@gitlab.palpant.us:justin/postgres-exporter.git
[submodule "gitlab-ci-pipelines-exporter"]
path = gitlab-ci-pipelines-exporter
url = git@gitlab.palpant.us:justin/gitlab-ci-pipelines-exporter.git
[submodule "gke-preemptible-killer"]
path = gke-preemptible-killer
url = git@gitlab.palpant.us:justin/gke-preemptible-killer.git
......@@ -5,35 +5,39 @@
- [Changelog](#changelog)
- [Unreleased](#unreleased)
- [Added](#added)
- [Changes](#changes)
- [Changed](#changed)
- [Removed](#removed)
- [1.2.0 - 2019-11-13](#120-2019-11-13)
- [1.3.0 - 2020-01-04](#130-2020-01-04)
- [Added](#added-1)
- [Changes](#changes-1)
- [Changes](#changes)
- [Removed](#removed-1)
- [1.1.1 - 2019-09-24](#111-2019-09-24)
- [1.2.0 - 2019-11-13](#120-2019-11-13)
- [Added](#added-2)
- [Changes](#changes-2)
- [Changes](#changes-1)
- [Removed](#removed-2)
- [1.1.0 - 2019-08-23](#110-2019-08-23)
- [1.1.1 - 2019-09-24](#111-2019-09-24)
- [Added](#added-3)
- [Changes](#changes-3)
- [Changes](#changes-2)
- [Removed](#removed-3)
- [1.0.0 - 2019-07-01](#100-2019-07-01)
- [1.1.0 - 2019-08-23](#110-2019-08-23)
- [Added](#added-4)
- [Changes](#changes-4)
- [Changes](#changes-3)
- [Removed](#removed-4)
- [0.2.0 - 2018-03-05](#020-2018-03-05)
- [1.0.0 - 2019-07-01](#100-2019-07-01)
- [Added](#added-5)
- [Changes](#changes-5)
- [Changes](#changes-4)
- [Removed](#removed-5)
- [0.1.1 - 2017-12-05](#011-2017-12-05)
- [0.2.0 - 2018-03-05](#020-2018-03-05)
- [Added](#added-6)
- [Changes](#changes-5)
- [Removed](#removed-6)
- [0.1.1 - 2017-12-05](#011-2017-12-05)
- [Added](#added-7)
- [Changes](#changes-6)
- [0.1.0 - 2017-11-25](#010-2017-11-25)
- [Added](#added-7)
- [Added](#added-8)
- [Changes](#changes-7)
- [Removed](#removed-6)
- [Removed](#removed-7)
<!-- END doctoc generated TOC please keep comment here to allow auto update -->
......@@ -45,8 +49,34 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.
## [Unreleased]
### Added
### Changed
### Removed
## [1.3.0] - 2020-01-04
### Added
- Deployed [gke-preemptible-killer](https://gitlab.palpant.us/justin/gke-preemptible-killer) to kill preemptible nodes unpredictably and at half their lifespan. While this increases the total number of nodes that get recycled, it makes it less likely that nodes will be preempted in a large group, which reduces the risk of a severe availability hit.
- Deployed [gitlab-ci-pipelines-exporter](https://gitlab.palpant.us/justin/gitlab-ci-pipelines-exporter) to expose Gitlab CI metrics to Prometheus.
- Deployed [postgres-exporter](https://gitlab.palpant.us/justin/postgres-exporter) to expose PSQL metrics to Prometheus
- Deployed [Thanos](https://github.com/thanos-io/thanos), a solution for long-term Prometheus storage, to back up Prometheus data to GCS and serve queries out of that long-term storage.
- [Thanos query frontend](https://thanos.palpant.us)
- [Querier](https://gitlab.palpant.us/justin/palpantlab-gitlab/blob/master/deploy/kubectl-apply/gke/thanos-querier.yaml) and a stateless Deployment
- [Sidecar](https://gitlab.palpant.us/justin/palpantlab-gitlab/blob/master/deploy/helm-upgrade/values.yaml#L266) as part of the Prometheus helm chart
- [Store](https://gitlab.palpant.us/justin/palpantlab-gitlab/blob/master/deploy/kubectl-apply/gke/thanos-store.yaml) as a StatefulSet with local ephemeral storage (but no persistent disks, since the local disk essentially serves as a glorified cache and should be easy to wipe.
- [Compactor](https://gitlab.palpant.us/justin/palpantlab-gitlab/blob/master/deploy/kubectl-apply/gke/thanos-compact.yaml) as a CronJob that runs daily on the batch-compute nodes
- Added a batch-compute nodes (node pools `pool-e2-batch-8cpu-megadisk` and `pool-e2-batch-4cpu-megadisk`) for jobs which require high CPU, high memory, or significant local ephemeral disk space.
- Significantly improved the usability of [Grafana](https://gitlab.palpant.us/grafana):
- Added persistence via PSQL and got rid of dashboard-by-configmap paradigm
- Added LDAP, Google OAuth, and GitLab OAuth as login methods instead of having just a single root account.
### Changes
- Changed rclone to Dropbox frequency from 1/day to 1/week for backups, Prometheus, LFS and GCR files to better identify which of these costs the most to export (answer: Prometheus).
- Lots of resource allocation changes - now that Prometheus is tracking CPU and memory usage, we can set tighter limits and pack our nodes tighter.
- Change the pool of stable notes from n1-standard-1 to n1-highmem-1
- Changed the base preemptible node pool type from n1-standard-2 to e2-standard-2 for a free 500Mi of RAM per node, and added a 100Gi root disk to allow for some ephemeral-storage allocation
- Made alertmanager a statefulset with 3 replicas, and added a [frontend](https://alertmanager.palpant.us).
- Upgrade GitLab to v12.6.2, chart v2.6.2.
- Upgrade oauth2_proxy to use v4.0.0 now that changes have been merged upstream.
### Removed
- Removed the k8s-node-termination-handler in favor of the gke-preemptible-killer
## [1.2.0] - 2019-11-13
### Added
......@@ -193,7 +223,8 @@ Lastly, I have split up the single mono-repo into individual repos to support si
- HAProxy, all instances
- Most of the 9s in my previous uptime. But they will be back, and better than ever!
[Unreleased]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.2.0...HEAD
[Unreleased]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.3.0...HEAD
[1.2.0]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.2.0...v1.3.0
[1.2.0]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.1.1...v1.2.0
[1.1.1]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.1.0...v1.1.1
[1.1.0]: https://gitlab.palpant.us/justin/palpantlab-infra/compare/v1.0.0...v1.1.0
......
Subproject commit d08da83dcb595c987c2256aa23046fbc40a35d14
Subproject commit 7e328aef45e6388a6576b702a2c246f97077bc83
Subproject commit 8187ec8378373b174957959f1614eec0374a57dc
Subproject commit bef9bc95777120ac45a9c483f479d1f0075a1595
Subproject commit b5670d28572f6ae1363c4ca8dd38dea5afb804f0
Subproject commit ccedcae4a9172a8bcf84b93433ba76acf1e667ea
Subproject commit 6f478c7ee9370962f291a6b03d12e202416844e8
Subproject commit fda79b7d8f433c2f5ee13d9a78c71c1c9100d64f
Subproject commit 699a5941b8bbd3dadd0836ddb3105a3975ffa9e5
Subproject commit 35a28f7445cfc1d5e53c7f3c747583fa22cff06e
Subproject commit 61f08a1a4441b2cc92b6831fefd4df6ae31e922b
Subproject commit 8bdbfb020f28bd2d6ea39b99f881caeb36162504
Subproject commit 6a77752bcc7f777d162e288a6c1708aecd9fe7d1
Subproject commit 8009c67658fa7548d9c4cae2bf5fecabbbedc541
Subproject commit 8873debf630ab5d1da9f8e6490b68d20e7315a28
Subproject commit 1eab8441885315b03289436ef9d140fdb0bee58a
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment