Skip to content

0.14.0

Compare
Choose a tag to compare
@determined-ci determined-ci released this 05 Feb 21:39
· 6405 commits to main since this release

Changelog

9ee2fa4 chore: bump version: 0.14.0rc4 -> 0.14.0
e7da518 docs: Minor edits to release notes for 0.14.0.
31c3ad4 edit
b09f452 edit
3f57533 Tweak release notes.
90cdf6e Tweaks for release notes.
def0d9b docs: Release notes for 0.14.0.
82ffa7e chore: bump version: 0.14.0rc3 -> 0.14.0rc4
55a3c22 chore: revert default images and framework versions (#1936)
5c9081b chore: bump version: 0.14.0rc2 -> 0.14.0rc3
94820c3 docs: edit model debug doc (#1925)
ef702dc fix: correct the comparison function when numbers are fractions [DET-4969] (#1924)
8369e11 refactor: paginate experiment trials [DET-4900, DET-4921, DET-4922] (#1892)
017764f fix: correct cancel confirm button label to confirm [DET-4966] (#1922)
77740fa fix: buffer the trial log in the correct order [DET-4931] (#1912)
1ddb99b chore: bump version: 0.14.0rc1 -> 0.14.0rc2
21c704d chore: improve resource pool details presentation [DET-4968] (#1926)
836414b fix: remove clickable style from trial info table [DET-4967] (#1923)
b03a114 fix: add default query limit and add missing sort by state [DET-4919] (#1921)
fd94f24 docs: fix incorrect reference in docs (#1919)
7a1f051 fix: typos in model debugging doc (#1918)
2b0031e fix: fix a utilization calculation error in hgi resource bar for cpu slots [DET-4913] (#1911)
5de2710 docs: clean up resource pool docs and add release notes (#1917)
4f6b6ba feat: support resource pools in det-deploy local agent-up [DET-4938] (#1906)
6c3150b refactor: update active experiments [DET-4915] (#1910)
4a86995 chore: bump version: 0.14.0rc0 -> 0.14.0rc1
024b9fa fix: correct best and latest metric sort by params for the GET experiment trials API (#1915) [DET-4920]
733dca8 fix: add non-scalar metric expectation to protobufs [DET-4893] [DET-4911] (#1876)
fb4e119 feat: support more fields to sortBy in /api/v1/experiments/trials [DET-4219, DET-4920] (#1899)
a152fd3 chore: Bump images and versions to Tensorflow 2.4.1 (#1913)
099d5b2 chore: let CLI verify the master using combined system/custom certs (#1859) [DET-4666]
e25c054 docs: add model debug doc (#1895)
c0dd89e chore: reword resource pool ui presentation [DET-4925] (#1898)
6f84f76 chore: bump version: 0.14.0.dev0 -> 0.14.0rc0
10af5ee chore: bump version: 0.13.14 -> 0.14.0.dev0
e933032 chore: bump version: 0.13.14rc0 -> 0.13.14
fe7973b chore: bump version: 0.13.14.dev0 -> 0.13.14rc0
22be8c5 revert: "Revert "fix: migrate trial log ID to bigint (#1792)" (#1901)" (#1902)
1e0948d Revert "fix: migrate trial log ID to bigint (#1792)" (#1901)
4a262bc chore: save user preference for cluster view [DET-4926] (#1896)
63abc66 fix: migrate trial log ID to bigint (#1792)
142b9d1 feat: show resource pools without connected agents [DET-4924] (#1897)
6987f34 chore: move task messages to sproto (#1891)
24a12bc docs: add topic guide for commands and shells (#1886) [DET-4901]
c353fa1 feat: Documentation and CI of support for NVIDIA A100's and Google A2 instances (#1888)
fa85d33 chore: fix type errors in IPC code (#1885)
058ba7d docs: improve resource pool docs (#1865)
702add8 chore: update trials API name (#1873)
e434619 fix: render resource pools in order (#1883)
55885bc chore: Upgrading environment and dependencies to PyTorch 1.7 and TensorFlow 2.4 (#1851)
ea9abae fix: don't lose logs of short-lived commands (#1882) [DET-4907]
2b8a99c fix: trial hangs when it fails to write to the DB (#1877)
d0b88fe fix: allow NULL trials.request_id for backwards compat (#1881)
434094b ci: tolerate longer time for concurrent log uploading [DET-4908] (#1879)
facd721 ci: fix kubernetes configuration resolution [DET-4909] (#1880)
fbe48c8 fix: CI failures caused by resource pool merge (#1864)
9a0d051 perf: move experiment API filtering and pagination to database [DET-4770] (#1803)
6be4a49 fix: altered tf.config function call to be compatible with tf 1.15 [DET-4852] (#1836)
e61415f build: fix for make check-schemas on non-GNU build machines (#1871)
d4a6a9a chore: update CLI commands to display resource pool [DET-4677] (#1709)
85267b5 docs: clarify preemptible instance doc for static and dynamic agents (#1870)
811b7dc chore: expose det-deploy AWS profile support [DET-4891] (#1868)
4bc23a9 chore: restart in-progress HP importance computation on master restart [DET-4675] (#1844)
d114d46 chore: remove the deprecated PyTorch API [DET-3262] (#1784)
d97735f fix: allow larger gRPC response bodies (#1869)
f987602 feat: enable new hgi-aware cluster page [DET-4854] (#1855)
7d4f248 build: avoid re-downloading codegen binary (#1838)
916be66 fix: handle learning curve edge cases [DET-4832] (#1827)
3810fc5 ci: check that Go dependencies are tidied (#1852)
93332bf test: print start time for each E2E test (#1867)
b7721a4 fix: order migrations in the order they landed (#1866)
71aa544 expand details in resource pool modal [DET-4884] (#1857)
54fd6fc feat: add resource pools (#1846)
05266cd docs: Release notes for 0.13.13. (#1843)
e657379 chore: bump version: 0.13.13.dev0 -> 0.13.14.dev0
5e88f2b feat: swap master restart to be snapshot based (#1745) [DET-816]
0db30c6 fix: don't set default trial log limit in CLI (#1856)
a8d4dd2 fix: fix CLI log tailing with elastic (#1853) [DET-4883]
354cdfa fix: another place scheduler config for resource pool not being inherited (#1854)
831235e feat: add resource pool column to tasks list (#1831)
a1821c8 feat: add resource pool column to experiment list (#1819)
ae8011c fix: webui trial logs should not use negative offset (#1845)
d812ab7 chore: connect HGI UI to its API [DET-4638] (#1837)
4fb65a4 fix: scheduler config for resource pool not being inherited (#1847)
6523a8c chore: update cluster utilization overview [DET-4346] (#1788)
9f57c9d docs: fix readme to clarify gpu vs cpu
5f661c8 chore: add custom error for torch's ReduceLROnPlateau (#1849)
2617e74 chore: bump taiko-video version to fix ffmpeg / screenshot save race condition (#1850)
2f2e5ee perf: index as few log fields as possible to increase elasticsearch ingest speed (#1848)
c6b1f0e fix: increase trial log timestamp resolution to support milliseconds [DET-4861] (#1841)
f9d31f9 chore: enable some more Go linters (#1839)
b4b1fe2 chore: retry for more errors when uploading to GCS (#1794)
2d2e96e chore: fix duplicates in elastic log ids (#1834)
79fe5ea chore: add missing apiKey update to internal streaming sdk (#1833)
57a9acc chore: update storybook to resolve github security vulnerability for highlight.js (#1808)
342527c chore: fix trial log following logic (#1832) [DET-4850]
91e2800 chore: Endpoint and infrastructure for hyperparameter importance computation [DET-4464] (#1707)
adc4361 chore: experiment API returns resource pool info [DET-4572] (#1711)
9a6da7a chore: fix ExitedReason log (#1829)
cf3accb fix: dars_penntreebank_pytorch example [DET-4841] (#1822)
513136d fix: show zoom out tip when zoomed into learning curve chart (#1828)
128531e chore: Revert DET-4688, do not support single-trial experiments in trials-sample endpoint [DET-4840] (#1824)
0b24334 fix: update model def button to be a raw link (#1826)
79df210 chore: various elastic fixes (#1825) [DET-4839]
b8d9e20 fix: update types to support new log levels (#1823)
9ae41f5 fix: revert broken user-facing change with experiment config logic (#1821)
b021498 docs: Add Lunch and Learn promotion to README.md (#1815)

Docker images

  • docker pull determinedai/determined-master:0.14.0
  • docker pull determinedai/determined-master:9ee2fa43
  • docker pull determinedai/determined-master:9ee2fa4321ff127bd0a08a90d15fa524d73b597c
  • docker pull determinedai/determined-dev:determined-master-9ee2fa43
  • docker pull determinedai/determined-dev:determined-master-9ee2fa4321ff127bd0a08a90d15fa524d73b597c