Releases: determined-ai/determined
Releases · determined-ai/determined
0.19.10
Release Notes
Changelog
- 600ec8b chore: bump version: 0.19.10-rc1 -> 0.19.10
- 5ff2dfc docs: add release notes for 0.19.10 (#5785)
- f329e04 chore: bump version: 0.19.10-rc0 -> 0.19.10-rc1
- a5f5178 chore: bump version: 0.19.10-dev0 -> 0.19.10-rc0
- 246386c chore: bump version: 0.19.9-dev0 -> 0.19.10-dev0
- dd691a0 chore: lock api state for backward compatibility check
- 485370f chore: mark and hide generated code (#5743)
- 8ed6cd4 fix: Pass default to overwrite Users table column preferences [WEB-760] (#5708)
- b61f85a fix: modify user session token set for external session (#5736)
- 5be6a11 fix: chaging the config from an error state to a new state cleans current errors (#5727)
- 6531d4e docs: python sdk title (#5700)
- d5a55e5 docs: fix make live (#5746)
- 990e62e chore: remove unused code (#5733)
- dc1ecef fix: make LegacyConfig the default choice (#5294)
- 8f7c2c9 fix: useModalCustomizeColumns values not updating on page change [WEB-696] (#5724)
- cea2837 ci: support intg tests with old db data (#5699)
- be67810 chore: use tf 2.8 in ci [MLG-11] (#5711)
- 8dc98ca fix: use new storagePath rules to store experiment-level trial info (#5635)
- 33b459e chore: add telemetry for instance provision requests on provisioner tick (#5656)
- 5b89b89 feat: manipulation of key-value start using storagePath (#5683)
- 73c1a9f fix: 'det job list' ignores the '--resource-pool' argument (#5713)
- 657b5ad chore: Use Pivot from UI Kit (#5695)
- 8216f12 chore: replace antd breadcrumb with UIKit breadcrumb (#5707)
- eb9e530 ci: autolabel docs pull requests (#5714)
- 4226975 chore: Use the UI Kit Pagination component in the application [WEB-776] (#5706)
- 53f1428 fix: check permissions loading in Page component (#5703)
- ed088bc feat: Dashboard (WEB-678) (#5601)
- 4f12118 chore: update browserslist as recommended (#5710)
- 97ca1b2 chore: fix ntsc api annotations (#5702)
- 73ef47e feat: allow configuring multiple RPs for Kubernetes [DET-8711] (#5698)
- 4952af2 chore: replace antd checkbox with UIKit checkbox (#5690)
- 203a2ff chore: fix ui kit Breadcrumb children (#5705)
- 62530a8 docs: update date in conf py file (#5691)
- b69bab1 chore: Use Tooltip from UI kit (#5689)
- 7c93454 chore: Upgrade all eslint devDependencies [WEB-758] (#5686)
- 2f56dea fix: add k8s permissions to fix patch bug (#3395)
- 6f24bd3 feat: k8s restore jobs on master restart [DET-8718] (#5584)
- 435de16 fix: experiment errors with set_stop_requested in dtrain [DET-8788] (#5677)
- 903ce81 feat: User Management modal changes (WEB-727) (#5654)
- 800f564 chore: updated master location hint (#5680)
- 936ecf4 fix: SSO Provider name capitalization (#5675)
- 9817eee fix: historical-usage page (#5681)
- 44ccf32 chore: ui kit [WEB-543] (#5430)
- 817b396 chore: remove mock_workspace_members feature flag [WEB-2] (#5676)
- b660a03 fix: trailing slashes for dirs with selectors (#5678)
- 8f7fca7 chore:
xlarge
resource class forpackage-and-push-system-dev
(#5679) - 8a9aa19 fix: return projects in GetProjectsByUserActivity when no experiments exist [WEB-767] (#5674)
- 27e4b8f chore: Move '+ New Workspace' button to nav item [WEB-698] (#5644)
- 7b48636 docs: Add known issue (#5662)
- f5d6567 fix: put Kubernetes resource pool at right address (#5672)
- 21feb9f chore: clean up det e create cli [MLG-44] (#4914)
- 99ba058 fix: validate workspace name (#5647)
- 97f120c chore: fix PyTorchTrial docstring (#5670)
- 448f3e3 chore: downgrade typescript from v4.9.3 to v4.8.4 (#5667)
- fb7694c refactor: prepare for multiple k8s RPs (#5665)
- 4f9a510 docs: documentation for job_project_source (#5659)
- 57fc20f chore: move k8s resource pool methods to correct file
- c15d6cf chore: remove data layer feature (#5567)
- 27b3bcd fix: save merged metadata for sharded ckpts locally (#5657)
- d611afc fix: revert change to flex menu (#5658)
- 9665d8e fix: allow killing of STOPPING_CANCELED experiments [DET-8283] (#5655)
- e26de79 fix: delete model version modal should dismiss on first try [WEB-744] (#5653)
- c71ce26 docs: fix typo in rbac docs (#5652)
- b51836f fix: changing displayname now reflects on the sidebar (#5603)
- 9b0c1a9 fix: log filters scoped to trial/task (#5626)
- c6ba866 fix: add missing key (#5650)
- 45274db fix: correctly report command submission time on restore (#5631)
- 9055409 fix: swagger json spec 404 error. (#5651)
- c044262 feat: master config option to enable notebook timeouts by default. (#5622)
- 38f5a55 fix: Workspace members search fixes for users and groups [WEB-753] (#5645)
- 4075c74 fix: remove modal in workspace member (#5640)
- 9577b7f fix: hook inside useMemo warning (#5638)
- 9cf35e7 fix: failed test case by adding missing context provider (#5637)
- b05178d fix: input fix for deletion modal (#5636)
- 0311534 chore: remove references to images with TensorFlow 2.5 and 2.6 from documentation and fixtures [MLG-217] (#5623)
- 17fce38 fix: Close continue/fork experiment modal [WEB-699] (#5624)
- 6de230f feat: WebUI use UserBadge component [WEB-721] (#5616)
- 27f3db2 fix: restore columns in exp. multitrial viewer (#5634)
- 48b6312 fix: gpt-neox e2e test (#5630)
- fcb0a3f ci: save time by only storing necessary artifacts (#5629)
- 3c0df83 fix: replace
InlineEditor
with antdInput
in table for description (#5625) - f09854f fix: WorkspaceProjects dropdown (#5628)
- 3297da7 feat: Include No Rank in rankIds dropdown [WEB-239] [WEB-240] (#5449)
- a69403f feat: memoized the vizible columns to avoid re-renders (#5581)
- 6791543 fix: view cluster as a non-admin (#5621)
- 35d9504 chore: WebUI remove old store (#5605)
- e822363 fix: Workspace store quickly reflects add/remove [WEB-635] (#5607)
- 0e5a319 fix: disable autocomplete in user group (#5615)
- c13be0c chore(deps): bump ubuntu from 20.04 to 22.04 in /master (#4325)
- bf1cb1f chore(deps): bump ubuntu from 20.04 to 22.04 in /agent (#4326)
- e1ac54d refactor: rewrite container manager lib without actors (#5386)
- da18942 chore(deps): bump github.com/grpc-ecosystem/go-grpc-middleware from 1.2.2 to 1.3.0 in /master (#4353)
- 7569b08 chore(deps): bump github.com/emirpasic/gods from 1.12.0 to 1.18.1 in /master (#4346)
- fe3ab3a fix: Tensorboard launch from experiment multi-select (#5610)
- 12a91e9 feat: Separate admin page [WEB-720] (#5608)
- 9f28c0c chore(deps): bump github.com/docker/distribution in /agent (#4354)
- f9e2bf6 build(deps): bump github.com/spf13/cobra from 1.2.1 to 1.6.1 in /agent (#5328)
- 97cc3a6 fix: Dashboard recent projects backend (#5606)
- aee93f0 build(deps): bump github.com/labstack/echo/v4 from 4.6.3 to 4.9.1 in /master (#5249)
- 6033108 chore(deps): bump github.com/gorilla/websocket from 1.4.2 to 1.5.0 in /master (#4344)
- 94afa11 fix: make AWS EFS, FSx deployments work. (#5602)
- 6cbefbd build(deps): bump github.com/docker/docker from 20.10.12+incompatible to 20.10.21+incompatible in /master (#5340)
- dc81769 chore(deps): bump github.com/shirou/gopsutil in /agent (#4345)
- 1d97bd3 docs: Add Slurm MaxNodes configuration info (#5604)
- 7a22d22 ci: commit generated go code for protobufs. (#5555)
- cd4a7db fix: hide Active Command UI unless it can be populated from API (#5600)
- 7b5e5bb fix: closing jupyterlab modal updates url (#5599)
- 825af7e fix: Cannot close workspace modal (#5598)
- a434c1f fix: Do not fail gc of already deleted checkpoints [FOUNDENG-387] (#5593)
- 42f71cf fix: note card height adjustment (#5597)
- 19f20a9 fix: docs should not require user auth. [DET-8744] (#5596)
- 9c0d261 fix: ensure current user available in store when reloading app (#5595)
- eaa9245 feat: create endpoint to return projects by activity [WEB-676] (#5582)
- 353e606 docs: Add "Ticket" section to pull request template (#5217)
- a11ff2f chore: add audit logging for echo endpoints (#5447)
- 96d4b42 fix: Enable sorting for user management table (#5594)
0.19.9
Release Notes
Changelog
- 6bd5f07 chore: bump version: 0.19.9-rc3 -> 0.19.9
- 04f9fc7 docs: add release notes for 0.19.9 (#5632)
- b9c2752 chore: bump version: 0.19.9-rc2 -> 0.19.9-rc3
- 2330c97 fix: WorkspaceProjects dropdown (#5628)
- e329ed5 chore: bump version: 0.19.9-rc1 -> 0.19.9-rc2
- 45ef5f9 fix: ensure current user available in store when reloading app (#5595)
- 440880f chore: bump version: 0.19.9-rc0 -> 0.19.9-rc1
- 72167da fix: Tensorboard launch from experiment multi-select (#5610)
- 499665a fix: hide Active Command UI unless it can be populated from API (#5600)
- 2babd44 fix: closing jupyterlab modal updates url (#5599)
- 3978d6f fix: Cannot close workspace modal (#5598)
- 7955b54 chore: bump version: 0.19.9-dev0 -> 0.19.9-rc0
- 60e63d7 chore: lock api state for backward compatibility check
- 28eaeb7 refactor: break out Kubernetes resource pool object [DET-8709] (#5589)
- 909f8ff feat: Add link to fork / continued trial [WEB-296] (#5588)
- 32a34bb feat: sharded checkpoints (#5489)
- 2696b69 chore: split knownroles from store [WEB-589] (#5580)
- 8c055db ci: Revert "ci: temporarily disable test results upload. (#5592)"
- 514eb2c chore: check and format examples (#5587)
- a17a251 chore: eliminate webui warnings [WEB-666] (#5586)
- bac53c9 ci: temporarily disable test results upload. (#5592)
- 96992d4 chore: docs for launcher-provided pools (#5566)
- 8e7f30e ci(circle-ci): install gke auth plugin (#5575)
- 0eb07b3 fix: workspace member (#5579)
- c3d10ae chore: update Kubernetes fetch-creds.sh to handle cert file path (#5524)
- bfc5067 docs: Minor Enroot doc fixes (#5585)
- 9d38289 feat: Display experiment total checkpoint size [WEB-298] (#5554)
- c7d1e62 chore: split omnibar state out of store (#5553)
- 33c9d2c docs: Increase Slurm dependency to 20.02 [FOUNDENG-377] (#5583)
- 4efa6a2 chore: Remove storybook (#5576)
- 66de8c4 fix: Report incorrect configuration message to stderr [FOUNDENG-370] (#5578)
- d60d1a0 chore: Pinned workspaces in stores/workspaces [WEB-586] (#5522)
- e05d17d chore: add structured task and trial logs (#5569)
- a038c8f ci: comment out the Ruby-based markdownlint checks (#5577)
- ead8c10 feat: fix spinner after logout (#5572)
- e377d2f chore: inherit PBS & Slurm defaults (#5560)
- 0f18bb8 chore: Remove check for deprecated init permissions (#5571)
- e9365d4 feat: moved users and auth away from the Store (#5539)
- 485971b feat: copy changes (#5561)
- d1e2e8e ci: fix TestRWCoordinatorLayer flake (#5563)
- 7c58a72 chore: Split UserRoles out of Store (#5549)
- 9899398 refactor: rewrite fluent lib without actors [DET-8303] (#5385)
- 1854b05 fix: ContainerLog.ToEvent honors Level [FOUNDENG-370] (#5558)
- 05c2754 chore: fix mypy false positive (#5559)
- f7a90e2 chore: remove unused file (#5557)
- c32ac36 chore: Split determined info from Store [WEB-584] (#5541)
- b18d7dd fix: jupyter modal stalling and garbling input [WEB-579] (#5556)
- bba3096 chore: remove unused setAgents store action (#5551)
- 59950b8 chore: fix potential named statement leak (#5320)
- 131bfb2 feat: add activity table and post user activity api [WEB-665] (#5518)
- b35d470 fix: group management edit and delete options (#5550)
- 27e86cc docs: NVIDIA Enroot support [FOUNDENG-329] (#5546)
- 1856fd0 chore: update gpt-neox example and improve deepspeed launcher (#5527)
- cd0aac8 chore: filter workspaces by userId (#5529)
- 6bb1b35 fix: use bash in bash script (#5540)
- 3736b2c chore: split
resource pool
from Store Context (#5543) - 023cd8b chore: Update resource pool resolution & validation (#5538)
- a2cc111 chore: Moving active experiment and task queries to separate stores [WEB-581] [WEB-582] (#5521)
- 1ae77fd feat: Oauth2 in Python SDK [DET-8504] (#5422)
- 7373713 chore: schemas implement generic-like behavior (#3763)
- 336ea28 chore: delete try_reauth from authentication logic (#5542)
- cd060ad ci: add markdownlint and json schema (#5545)
- cec6348 chore: transformer image installs scikit-learn (#5547)
- b81da3b chore: add Session.with_retry() (#5535)
- 7695e35 refactor: rewrite container lib without actors [DET-8301] (#5384)
- 9265ba1 fix: Enable tasks filter by user [WEB-675] (#5525)
- af269e5 chore: bump version: 0.19.8-dev0 -> 0.19.9-dev0
- 7f741f4 docs: add release notes for 0.19.8 (#5537)
- e07330e ci: move useful weekly tests to master branch, remove the rest [DET-8725] (#5530)
- 4161fc6 fix: workspace list pagination (#5534)
- c1a237f fix: sandbox experiment list settings for each project (#5531)
- ca31dcc chore: bump antd (minor version update) (#5475)
- 6a392b9 feat: open tasks in existing tab [WEB-420] (#5528)
- f60e8f4 fix: avoid error converting user requested stop for custom searcher (#5520)
- 6028164 feat: add icons in experiment and trial detail pages (#5512)
- aeb35f6 feat: WebUI support user agent group when creating user [WEB-638] (#5446)
- 67010f7 test: speed up nightly
test_protein_pytorch_geometric
. (#5513) - 58de4d4 fix: Workspaces list, prevent settings update loop (#5519)
- 1622710 feat: migrate projects to use store/context (#5509)
0.19.8
Release Notes
Changelog
- 5eb5b45 chore: bump version: 0.19.8-rc2 -> 0.19.8
- 95c6ef4 docs: add release notes for 0.19.8 (#5537)
- 5c4c600 chore: bump version: 0.19.8-rc1 -> 0.19.8-rc2
- e8ea6b6 fix: sandbox experiment list settings for each project (#5531)
- 0e8d941 chore: bump version: 0.19.8-rc0 -> 0.19.8-rc1
- bcfca6a fix: Workspaces list, prevent settings update loop (#5519)
- b8cb0c1 feat: migrate projects to use store/context (#5509)
- b2e7c4e chore: bump version: 0.19.8-dev0 -> 0.19.8-rc0
- 682745a chore: lock api state for backward compatibility check
- 6b88d71 chore: add pre-commit check for python (#5392)
- 1c06774 feat: det deploy aws enterprise edition (#5516)
- 17a65b3 feat: retry transient failures on det experiment wait (#5393)
- b0a3f29 fix: nightly test for custom searcher (#5515)
- 2c3a54e fix: Create workspace as non-admin [WEB-691] (#5517)
- fd98a7d docs: Document PBS CUDA_VISIBLE_DEVICES requirement [FOUNDENG-359] (#5514)
- 63e502a docs: Document PBS CUDA_VISIBLE_DEVICES requirement [FOUNDENG-359] (#5511)
- 41dc8c1 chore: add tensorboardTimeout to Helm chart [DET-8716] (#5500)
- 5331719 chore: changes k8s unable to find exit status from 137 -> 1025 [DET-8717] (#5507)
- c2d050c chore: minor fixups (#5510)
- e930e2e chore: bump
Prettier
v2.8.0 (#5505) - 071b6ef fix: rbac describe role fails with --json flag (#5506)
- 2e492cb test: fix model registry test. (#5504)
- 1a62e3d ci: fix test_workspace_org (#5503)
- 5febd98 fix: cli and sdk logout works without login (#5493)
- 90f18a2 fix: Increase experiment icon font weight (#5488)
- 38ead98 chore: remove unnecessary cleanup (#5498)
- 1e3041c feat: Add warning for submissions for requests that cannot currently be fulfilled [DET-6410] (#5376)
- 09f3781 fix: aws instance id under IMDSv2. (#5494)
- dd761a6 docs: Slurm/PBS override of default resource pools (#5496)
- 5b3d8ae feat: Add slurm.gpu_type to expconf [FOUNDENG-338] (#5492)
- 57dc77c chore: container defaults for PBS/slurm (#5485)
- 3da4a45 chore: configure HPC resource pool providers (#5479)
- e5d5477 fix:
GetModelVersion
lookup by version number. (#5408) - 1d8cda2 ci: disable Tensorflow 2.5-2.7 tests. (#5487)
- 65562d6 ci: bump openjdk version. (#5490)
- 95f704a chore: Move workspaces to their own store [WEB-674] (#5484)
- 1c883d9 refactor: restructure rm package (#5457)
- 8fc3ce3 fix: Ensure authorized_keys permisison [FOUNDENG-342] (#5454)
- 9dd5829 ci: fix rounding that created invalid timestamppb.Timestamps (#5482)
- a0652de fix: solve flakiness for the Spinner tests (#5483)
- 7e02d2b fix: Experiment listing should not have spurious state triggers (#5477)
- ecdb073 fix: metrics selection (#5480)
- 17c444d chore: tighten py bindings to_json advertised type (#5450)
- f2fe593 ci: bump python 3.7.11 -> 3.7.15. (#5476)
- 8957c02 feat: useSettings improvements (#5187)
- d85ee6c chore: more pbsbatch_args usage (#5458)
- 8e317cb fix: trial comparison modal style (#5467)
- 8d349cc fix: handle 'null'::jsonb when aggegrating resource size in proto_get_trials_plus.sql (#5464)
- c05a5e2 perf: remediate issues with
cast
+proto_checkpoints_view
(#5465) - f55aa86 chore: replace deprecated scss (#5460)
- 2384e56 chore: remove
skipLibCheck
(#5459) - 13fa0c1 refactor: rewrite docker lib without actors [DET-8300] (#4943)
- 7832a93 style: remove wall of echos from shell script (#5456)
- fb4240b refactor: rewrite websocket lib without actors [DET-8299] (#5382)
- 118777a fix: WebUI project loading msg (#5439)
- 662a023 fix: note page button issues (#5453)
- ec01546 feat: WebUI use icon for experiment/trial state [WEB-237] (#5373)
- 8103175 string interpolation log (#5455)
- e0b9572 fix: Jupyter notebook iframes (#5434)
- 1a79b1a fix: Revert changes to rankId and other numeric filters (#5452)
- 931f79c feat: enable limited core api usage in NTSC (#5451)
- b279bb5 feat: user mgmt functions in SDK + new user API from old API + CLI uses SDK [DET-8495, DET-8496] (#5206)
- 08f40af chore: bump
Typescript
(#5444) - 781e754 chore: Add expconf slurm.gpu_type [FOUNDENG-338] (#5448)
- c7d6923 chore: FOUNDENG-296 Determined shows PULLING state while container is spinning up (#5443)
- 516f2af fix: editing projects in table [WEB-641] (#5445)
- 75e8214 docs: Document Singularity cache managment script [FOUNDENG-333] (#5437)
- 3c82434 feat: remove checkpoints from multi trial tabs [WEB-639] (#5440)
- 4710747 fix: remove
noImplicitAny
(#5416) - c4fe410 fix: experiment with null notes can't load (#5429)
- aa94e65 revert: Add webhooks_base_url to det deploy (#5432)
- dc93b51 fix:
timeAgo
tooltip position (#5419) - 2aba2a4 fix: sync modal data when cancel (#5433)
- df231ec ci: bump resource class to avoid test-intg-master flakes (#5438)
- cb7e082 feat: Trial and Task log filters [WEB-239] [WEB-240] (#5420)
- 4356fae Revert "chore: Revert content-security-policy change" (#5424)
- c19e23f chore: make patch user atomic [DET-8659] (#5428)
- bf02b05 feat: Add webhooks_base_url to det deploy (#5354)
- 5a476ac feat: master and agent instances created with IMDSv2 support [DET-7987] (#5421)
- 43e176e chore: bump version: 0.19.7-dev0 -> 0.19.8-dev0
- 1b3cb55 docs: add release notes for 0.19.7 (#5425)
- ff7ad9e chore: Revert content-security-policy change
- 63dc63a chore: Add content-security-policy via meta tag, webpack plugin [WEB-310] (#5414)
- 5a3acc1 fix: fix copy button (#5417)
- 75969d3 fix: better name for hermesfilters; listen for onReset for filters (#5405)
- bf0d2a8 fix: WebUI remove checkpoint storage config (#5409)
- f29cfd0 fix: avoid panic in getTasks (#5412)
- a453ed5 fix: clear table settings between users and groups pages (#5415)
- e1270e5 fix: adds user to group when create group through external token (#5413)
- cbd92d3 fix: workspace header in mobile view (#5406)
- 6fa66ca chore: migration for adding newly created scim users to usergroups (#5410)
- fb10b6f build: lint and fmt shell scripts [DET-7566] (#5389)
- 384a209 fix: create personal groups for other ways a user can be created (#5407)
- 9933a46 fix: Avoid writing to users known_hosts [FOUNDENG-314] (#5403)
- 8ba901e chore: remove unused slurm options validation (#5401)
- 87982d0 fix: handle webhook testing errors (#5404)
- 3e54926 fix: Skip role fetch when rbac not enabled (#5402)
- e6d5f6d fix: sort by log severity level (#5390)
- 5c7d5c7 fix: update webhook payload check in e2e test (#5377)
- ccb1e95 ci(test-e2e): add e2e test mark as cluster label (#5368) [INFENG-5]
0.19.7
Release Notes
Changelog
- b6c8b94 chore: bump version: 0.19.7-rc2 -> 0.19.7
- 806c033 docs: add release notes for 0.19.7 (#5425)
- 95c3c48 chore: bump version: 0.19.7-rc1 -> 0.19.7-rc2
- a522791 fix: fix copy button (#5417)
- 6fe0037 fix: better name for hermesfilters; listen for onReset for filters (#5405)
- 8c5a63e fix: WebUI remove checkpoint storage config (#5409)
- 3de9bdb fix: Skip role fetch when rbac not enabled (#5402)
- 64ccb0a fix: avoid panic in getTasks (#5412)
- 51f2e98 fix: adds user to group when create group through external token (#5413)
- 3d88e34 chore: bump version: 0.19.7-rc0 -> 0.19.7-rc1
- 1d8f461 chore: migration for adding newly created scim users to usergroups (#5410)
- 7078786 fix: create personal groups for other ways a user can be created (#5407)
- ef75386 fix: workspace header in mobile view (#5406)
- 817abe3 fix: update webhook payload check in e2e test (#5377)
- 3bdad41 chore: bump version: 0.19.7-dev0 -> 0.19.7-rc0
- b2746c0 chore: lock api state for backward compatibility check
- e1b26a5 feat: WebUI expand workspace configuration [WEB-33] (#5396)
- fb5ea8b chore: Rbac oss audit logging [DET-8309] (#5378)
- 76fb4f7 fix: redirect url to login page (#5399)
- 0075d01 chore: pause shared-web ci checks (#5388)
- b149ee7 ci: increase RAM for master build (#5397)
- 8226925 chore: Move Agents into their own context, introduce Loadable abstraction (#5358)
- 20f548d fix: Workspaces member page refetch after update [WEB-621] (#5391)
- 5db6ef4 chore: cli allow workspace --checkpoint-storage-config-file to use yaml (#5394)
- cd6c3cd fix: typescript inside keys function (#5383)
- 2c08b99 fix: group together 4xx and 5xx HTTP responses for prometheus [DET-8336] (#4996)
- b81c4b4 ci(remote-docker): pin version to 20.10.18 (#5369)
- 85536a7 ci: add pre-commit execution action (#5379)
- 264f489 docs: RBAC v1. (#5298)
- 892ef4a ci: enable yamllint [INFENG-116] (#5359)
- 282ddc0 fix: TorchWriter.reset() closes SummaryWriter (#5375)
- f9d8cfe fix: enter to submit (#5374)
- 6472abe chore: support out-of-k8s dev workflow (#5237)
- bbebe34 chore: generate PATCH-friendly bindings (#5325)
- 3221e4e fix: allow workspace admins to change role assignments in webui. (#5371)
- cb7156a feat: checkpoint storage config per workspace [DET-8350] (#5309)
- 83b201d ci: restrict token perms & change head ref
- 288c69c fix: improve previous tag logic (#5367)
- 44000a6 fix: validate username and display name in
PostUser
. (#5366) - 1475136 ci: remove -f from remote
- 62c6e3e ci: add throwaway user information for rebase
- 23b34cb ci: use head_ref for rebase if available; else ref
- 752c7fa ci: switch rebase check to use https
- 3a26aef ci: Move conditional down to steps
- 7231c1e ci: Switch checkout method [INFENG-98] (#5364)
- 2ebf336 chore: Enable library checks for TypeScript; update types [WEB-191] (#5360)
- 68b6205 docs: remove references to PBT (#5257)
- c8cfcaa ci: add more credentials [INFENG-98] (#5363)
- 65f6df3 fix: Shiyuan's suggestion to release note (#5362)
- f16095d ci: fix action env vars
- 342e5d8 ci: Add rebase check on PR/main push [INFENG-98] (#5190)
- 774275c Revert "fix: shiyuan's change to custom searcher release note"
- f3c6dbe Revert "lint"
- b902fd4 lint
- b2b3be5 fix: shiyuan's change to custom searcher release note
- 540d486 ci: Fix previous tag logic (#5103)
- c82d0af fix: Group management page role edit [WEB-544] (#5339)
- 5729247 chore: rbac refactor authz iface (#5343)
- 149f9d9 Add release notes header with link [INFENG-115] (#5327)
- 1ad39c3 chore: bump version: 0.19.6-dev0 -> 0.19.7-dev0
- fab75cc docs: add release notes for 0.19.6 (#5357)
- f2c051b feat: remove webhooks feature flag [WEB-560] (#5345)
- 28c10ed docs: fix tutorial link (#5355)
- 5adb2e2 refactor: rbac protos: move from is_global to scope type masks. (#5346)
- e4a9e02 fix: signed payload generation (#5353)
- 3c8b2f1 chore: Replace react notebook library with notebookjs [WEB-77] (#5284)
- 6e19529 feat: add stable diffusion textual inversion example (#5280)
- de0272d chore: replace
enum
withobject as const
(#5348) - 761cf72 fix: overflow of long names without spaces [WEB-485] (#5351)
- c9ddf77 fix: fix roll polling in non rbac instances (#5349)
- 781716f chore: add field for internal web UI use to identify product context (#5342)
- 0d4281e chore: introduce useui as a separate store [DET-8575] (#5338)
- f3319c0 chore: add style lint fix (#5337)
- f0dbef8 docs: Explicitly document Apptainer as supported [FOUNDENG-294] (#5344)
- 2747638 feat: Create Webhook Sender [WEB-213] (#5258)
- 623976d feat: when user can't create workspaces, show a disabled button [WEB-553] (#5341)
- 2f5a5ab fix: duplicated messages logged at INFO and ERROR levels. (#5323)
- 0a651a3 fix: add and edit workspace member UI [WEB-552] (#5335)
- f41a90d chore: don't tell users to contact admin for wrong password or wrong username (#5336)
- 3ce6cd7 chore: HPC job ID is logged in experiment log. (#5315)
- aaba906 feat: Non-global permissions assignable globally [WEB-539] (#5324)
- 9f76ecb feat: User can change their own username [WEB-238] (#5304)
- 6a663c5 fix: improve chart interactivity (#5326)
- 5f8c3e2 chore: simplify
ptrs.Ptr
(#5330) - 0370f54 fix: login loading (#5334)
- 6ec84d1 chore: log error when experiment is unrestorable (#5333)
- 02c5ab2 fix: HP Parallel Coordinates remembers filters as data comes in [WEB-279] (#5243)
- 99827d7 fix: exclude allocations without start times from aggregation (#5329)
- 9cf034b fix: account for incompatible pbs-related expconf change (#5332)
- b646464 feat: support streaming logs in Python SDK [MLG-46] (#5174)
- b70f8a4 chore: delete dead code to fix flake8 (#5331)
- 0780178 fix: Poll roles, remove canGetPermissions permission (#5322)
- 1f1fd81 chore: change list-users-roles and list-groups-roles to return assignment info (#5286)
- 45d30bc feat: implement feature switch [WEB-535] (#5310)
- 23d951a docs: Notification doc zapier [WEB-216] (#5305)
- 039fd0d docs: slots_per_nodes for PBS & Slurm (#5314)
- 9edc784 docs: Add PodMan requirements and known issues [FOUNDENG-289] (#5311)
- ea333d6 fix: move permission denied error to not encounter hash error (#5319)
- 6c814be chore: Revert "fix: remove duplicate event message." (#5316)
- e4b1494 chore: add FieldMask type to apiutils (#5247)
- 677dbc9 test: fix flaky custom searcher test (#5317)
- 2a76a70 feat: actually support --device strings (#5287)
- 722dfdb fix: dont trigger loading state on experiment selection (#5299)
- 58d2a27 chore: replace
enum
withobject as const
(#5308) - 1ef5210 fix: layout of logview (#5307)
- c711c43 ci: fix missing say command (#5313)
- 13760a3 fix: tweak live docs server. (#5293)
- b3ff34c fix: remove duplicate event message. (#5252)
- 2d4eff4 feat: Changing and removing roles from workspace members list (#5283)
- 4fae14f fix: remove compare action for experiments (#5302)
- 7f33e15 fix: dont show loading state when polling trial details (#5301)
- 5ae91f1 chore: API and DB error for invalid input (#5212)
- bdf5b3e fix: UI improvement when no permission [WEB-532] (#5274)
- 2dd8e71 fix:
det rbac describe-role
list global assignments for users. (#5297) - 10a866f fix: dont show scrollbars in table cells (#5290)
- 02511f2 chore: take out manual shared testing instructions (#5270)
- f40c992 chore: Revert "chore: update to
Stylelint
v14 (#5238)" (#5296) - e0b4b54 fix: Avatar text color change (DET-8237) (#5246)
- 515d25b fix: account for incompatible expconf change (#5292)
- ae4282e docs: add documentation for setting up Slack Webhooks [WEB-215] (#5278)
- 1a2acad fix: fetchMyRoles on login, only if rbac is on (#5291)
0.19.6
Release Notes
Changelog
- 12adc47 chore: bump version: 0.19.6-rc6 -> 0.19.6
- 2766fd1 docs: add release notes for 0.19.6 (#5357)
- 4cb7414 chore: bump version: 0.19.6-rc5 -> 0.19.6-rc6
- ad4aca7 chore: bump version: 0.19.6-rc4 -> 0.19.6-rc5
- 54be3a6 chore: bump version: 0.19.6-rc3 -> 0.19.6-rc4
- 68013ba chore: bump version: 0.19.6-rc2 -> 0.19.6-rc3
- 1d124b1 fix: account for incompatible pbs-related expconf change (#5332)
- 5ecd136 chore: bump version: 0.19.6-rc1 -> 0.19.6-rc2
- a19bbf0 fix: dont trigger loading state on experiment selection (#5299)
- 05ad8b6 fix: layout of logview (#5307)
- 846e3e3 fix: remove compare action for experiments (#5302)
- 96cde39 fix: dont show loading state when polling trial details (#5301)
- 89a3b60 chore: bump version: 0.19.6-rc0 -> 0.19.6-rc1
- b42a431 chore: Revert "chore: update to
Stylelint
v14 (#5238)" (#5296) - 88f75e3 fix: dont show scrollbars in table cells (#5290)
- 073d79a fix: fetchMyRoles on login, only if rbac is on (#5291)
- 40dbeac fix: account for incompatible expconf change (#5292)
- 4f849d6 chore: bump version: 0.19.6-dev0 -> 0.19.6-rc0
- ba7ae15 feat: Trials Comparison Frontend (#4820)
- 7c20754 chore: update to
Stylelint
v14 (#5238) - d140af0 fix: keyboard scrolling in logs [WEB-463] (#5266)
- fc05dfd fix: lockdown NTSC EE rebase import cycle (#5285)
- 60e5fe1 feat: custom searcher (#4424)
- 4b8c94f docs: Add PBS documentation [FOUNDENG-184] (#5232)
- 013b3d9 docs: external notification general doc [DET-219] (#5244)
- 020d23a fix: remove =true from sso url querystring (#5282)
- 6c54d38 chore: improve project not found error message in create experiment (#5277)
- 36e4b35 fix: position resize shadow under cursor at start (#5242)
- dd3e39d feat: support configuring agent self-shutdown for
det deploy aws
(#5241) - 644e747 fix: Wire up user edit modal to rbac endpoints (#5261)
- 340c8a1 chore: remove unnecessary getPath usage (#5272)
- 5db81f6 refactor: Add ability to lockdown NTSC [DET-8276, DET-7377] (#5260)
- a487eab refactor: authz checkpoints [DET-8533] (#5233)
- 3ea55a7 fix: add version check for protoc to prevent confusion (#5271)
- 7ac9e7e fix: workspace page tab route (#5269)
- c4b7b50 chore: add to git blame ignore list (#5262)
- 7b264a3 fix: Workspace member modal handling of groups and roles (#5268)
- aa9f1ba chore: make web_lint_check.py executable (#5265)
- 781bff6 fix: fix getPath (#5256)
- 21355bf fix: check target user, allow \wadmin\w names but not admin itself [WEB-529] (#5251)
- 2fee696 feat: page listing Webhooks [from main repo] [WEB-211] (#5259)
- ad8e6ef fix: selector fixes in log viewer (#5192)
- b3f253f docs: fix fypos, grammar edits for clarification, etc. (#5186)
- 67890df fix: Load workspace members from RBAC API (#5253)
- 81ac804 chore: PBS & Slurm options for commands (#5219)
- cef9430 fix: decorate PBS & slurm options with ,omitempty (#5230)
- 8cf354c feat: support -i/--include [MLG-194] (#5193)
- 77ff49a chore: bump version: 0.19.5-dev0 -> 0.19.6-dev0
- 5ce358c docs: add release notes for 0.19.5 (#5227)
- 794742b build(deps): bump amannn/action-semantic-pull-request from 4 to 5 (#5239)
- 79a1630 feat: ls for all applicable cli options [MLG-193]
- c5b535b fix: trial workloads sort and filter (#5228)
- 7096deb fix: Roles should be fetched immediately when logging in (#5229)
- b32039d chore: update to
React18
(#5226) - f88fddc fix: register checkpoint "new model" workflow [WEB-499] (#5215)
- 8ee7e8c perf: remove unused getExpValHistory calls (#5172)
- 7e7cd96 refactor: Authz tasks [DET-8367] (#5209)
- dc61608 fix: hide uncategorized from nav, when it is not available to RBAC user (#5203)
- 10f5d7b feat: Add CAN_EDIT_WEBHOOKS permission to pre-canned admin role [WEB-218] (#5200)
- 552bef9 chore: update to
react-router-dom
v6 (#5222) - fccaeca test: enables linting for master integration tests (#5223)
- b35a559 fix: correctly aggregate allocation resources by agent label. (#5214)
- b9803c0 chore: Round Robin scheduler message (#5216)
- cfdd7bb fix: custom error requires permission object (#5221)
- bdb41a8 build: fix check if test-intg-downstream could be skipped (#5115)
- 999eb80 feat: add echo authentication by default (#5008)
- 906356e chore: new react-router components (#5166)
- d285e8f chore: add custom permission [DET-8526] (#5195)
- b5d0185 fix: replace shell script with python script for
Pre-Commit
(#5220) - 98ef84e fix: fix the error message for auto checkpoint download (#5201)
- 4806f1f ci: only run lint in package-and-push for release workflows. (#5191)
- c091de9 chore: revert "fix: use theme var in Avatar stylesheet [DET-8237] (#5109)" (#5211)
- b8155bd fix: table ui follow up (#5207)
- 8f01284 fix: only call fetchKnownRoles in RBAC, closing testing issue (#5210)
- 78b2b9d refactor: authz allocations [DET-8366. DET-7971] (#5178)
- 8a7e799 chore: usePolling on fetchMyRoles (#5204)
- 48ac9af fix: RBAC calls getPermissionsSummary, changes to admin's use of listRoles [WEB-517] (#5185)
- cf2df49 chore: remove accidentally added attributions file (#5202)
- 20fde08 fix: resolve table UI issues (#5199)
- 08bab93 fix: remove unnecessary loading state (#5181)
- 37fe428 fix: page transition in multi trial page (#5198)
- 106e231 feat: rbac authz refactor for user groups [DET-8478] (#5136)
- 522d192 fix: avoid using lookbehind regex (#5197)
- 40b3739 fix: stop infinite rerender in move experiment modal (#5194)
- 0d608e5 fix: check for horovod backend in PyTorchTrial (#5180)
- 0429c64 ci: bump cache buster version (#5189)
- 3cecf50 feat: Add Webhook CRUD API [WEB-212] (#5175)
- 8f01b46 feat: enable configuring the agent to shut down on connection failure (#5044)
- cad902c feat: support
security.authz
config option in helm chart. (#5183) - 5d5be28 chore: refactor slurm/pbs options to expconf (#5150)
- 1703adf fix: do not publish helm chart for *-rc releases [DPS-260] (#5182)
0.19.5
Release Notes
Changelog
- 1d0fe59 chore: bump version: 0.19.5-rc2 -> 0.19.5
- c8b90b2 docs: add release notes for 0.19.5 (#5227)
- 56372bb fix: fix the error message for auto checkpoint download (#5201)
- 0e9f57b chore: bump version: 0.19.5-rc1 -> 0.19.5-rc2
- c3248ba chore: revert "fix: use theme var in Avatar stylesheet [DET-8237] (#5109)" (#5211)
- 23c2acb fix: table ui follow up (#5207)
- da655cb chore: bump version: 0.19.5-rc0 -> 0.19.5-rc1
- 1ce5465 fix: resolve table UI issues (#5199)
- 4535049 fix: page transition in multi trial page (#5198)
- 62b9fab fix: avoid using lookbehind regex (#5197)
- e2a1ba8 feat: enable configuring the agent to shut down on connection failure (#5044)
- 4a2ea91 fix: do not publish helm chart for *-rc releases [DPS-260] (#5182)
- 52cceb9 chore: bump version: 0.19.5-dev0 -> 0.19.5-rc0
- 2183501 chore: lock api state for backward compatibility check
- e59380a fix: streaming errors not being handled (#5171)
- ee1195e feat: agent user group settings per workspace [DET-8472, DET-7547] (#5122)
- 21e29b8 fix: use theme var in Avatar stylesheet [DET-8237] (#5109)
- 07309d5 chore: streamline local frontend against remote cluster (#4593)
- 12b42ef fix: Sort workloads by training and validation metrics [WEB-430] (#5167)
- ec90ab1 feat: short options for RBAC CLI. (#5169)
- 08e270a test: fix
test-k8-mount
. (#5153) - 2dcbab9 fix: Table reordering fixed (#5159)
- a70db1f chore: update readme for web (#5168)
- 11ed4c3 fix: dont show unknown error popups for telemetry
- a2a66b7 chore: lint-staged for web (#5155)
- f920bde fix: more correctly compute TLS cert hash in CLI (#5164)
- 3b22f70 chore: Convert permission.name string to permission.id enum [DET-8464] (#5121)
- 316413a fix: new tab to open Jupyter (#5145)
- 79f0296 fix: remove workspace members view mock special casing (#5138)
- 6ac2631 fix: FOUNDENG-246 Task reports "RuntimeError: Dataset not found or corrupted" (#5160)
- e54b33a chore: replace router hooks for react-router-dom v6 (#5142)
- 9a31b85 fix: regression in master user auth error message. (#5152)
- d7b9bc5 feat: groups and users to workspace proto (#5085)
- 704ea4f feat: cli: support downloading checkpoints through master (#5083)
- 80e880b feat: support slots_per_trial=0 in Trial classes (#5035)
- 8bd9360 build: setup
Pre-Commit
(#5116) - ef26d7f fix: handle transient GCS errors in Tensorboard upload [DET-8491] (#5151)
- cf54175 fix: shells fail if vars contain a newline [FOUNDENG-251] (#5148)
- 8df93c5 fix: adjust elevation for dark mode (#5147)
- 058713a chore: Placeholder when notes/markdown is disabled [DET-8409] (#5131)
- f5b7c99 feat: Table component performance enhancement (#5056)
- 4d0addd feat: New Project button/message checks permission [DET-8374] (#5114)
- 1d0963c feat: checkpoint download through master (#4989)
- 2a16f81 perf: fix memory leak warnings (#5139)
- 61103df fix: remove temporary text (#5141)
- c82be87 ci: cache more Go files (#5140)
- 6d8a495 fix: doc typos (#5127)
- a72305f chore: only request listRoles with rbac on [DET-8419] (#5132)
- c596cd9 docs: bring back analytics. (#5134)
- 027ef16 feat: Trials Comparison backend (#4543)
- 41deb4d chore: replace use of XORTrials with OneVarTrials in TestPyTorchTrial [MLG-42] (#5107)
- 91bd0e3 fix: null check (#5130)
- 5320aec docs: fix user-reported errata. (#5133)
- e896981 feat: RBAC for experiment actions [DET-8372] (#5069)
- 47dd822 fix: tensorboard inherits imagePullSecrets from experiment [DET-8458] (#5123)
- 5fc93fd feat: Add "Members" view to workspace page [DET-8219] (#5113)
- bd57ec4 ci: Switch to token w/ higher quota [INFENG-100] (#5125)
- 8f2e29e chore: oss refactor for auto assign WorkspaceCreator role plugin (#5075)
- 26bfdcf chore: move polling hook to shared (#5078)
- fc778f3 feat: rbac CLI [DET-7868] (#5061)
- de4b7bc chore: reuse created loggers (#5077)
- 010382b fix: prevent reattach deadlock upon container reattach (#5112)
- e1f9502 chore: bump version: 0.19.4-dev0 -> 0.19.5-dev0
- 38813e8 docs: add release notes for 0.19.4 (#5110)
- af8be29 build: skip more jobs for web-only prs (#5117)
- 672eda9 fix: fix unshared dependency [DET-8462] (#5118)
- 08c1472 chore: fix package-lock format compatibility (#5073)
- 8d9bbf5 fix: remove
as any
(#5089) - f6faa57 fix: No Permissions page vs. 404 page (#5088)
- db3c53b feat: Reconnect to Slurm jobs on startup (FOUNDENG-215) (#5104)
- a9c2a88 chore: include running eslint fix in fmt target (#5096)
- dd59244 chore: remove check from pytorch [MLG-182] (#5071)
- 02a8f45 fix: fix useSettings bug (#5102)
- d74d5ac fix: test_efficientdet_coco_pytorch_const failing (#5097)
- f7098e2 fix: Initial comm failure should provide explaination (FDN-217) (#5092)
- ea58514 fix: new states can be paused, canceled, killed [DET-8449] (#5090)
- 9a0bcb7 fix: display primitive hps correctly in parallel coordinates plot (#5091)
- 399435c fix: allow sort of metrics under avg_metrics [DET-8408] (#5086)
- 8100711 ci: store docs output as a single file (#5081)
- 80df08b refactor: improve storybook [DET-8099] (#5011)
- 1734a84 fix: upload helm chart to GitHub release artifacts [INFENG-93] (#5064)
0.19.4
Changelog
- 1b50f1c chore: bump version: 0.19.4-rc3 -> 0.19.4
- 424cbaf docs: add release notes for 0.19.4 (#5110)
- 11b8688 chore: bump version: 0.19.4-rc2 -> 0.19.4-rc3
- 3ce78d3 fix: consolidate useSettings changes (#5106)
- 555b583 chore: bump version: 0.19.4-rc1 -> 0.19.4-rc2
- 25929ee fix: fix useSettings bug (#5102)
- 2197af1 chore: bump version: 0.19.4-rc0 -> 0.19.4-rc1
- ac8a69a fix: new states can be paused, canceled, killed [DET-8449] (#5090)
- 375b7c9 fix: display primitive hps correctly in parallel coordinates plot (#5091)
- 7dffc93 fix: upload helm chart to GitHub release artifacts [INFENG-93] (#5064)
- 56daac2 chore: bump version: 0.19.4-dev0 -> 0.19.4-rc0
- 89bdccf chore: lock api state for backward compatibility check
- f8dc2c8 fix: bumpenv again after moving pip install protobuf (#5082)
- ad92bd8 chore: Avoid error from RBAC listRole endpoint [DET-8395] (#5079)
- 64640fd chore: format code (#5080)
- ae0207d revert: work around a bum pyzmq build (#5074)
- 85ce9a1 feat: WebUI edit permission at user profile [DET-8224] (#5068)
- 97408f3 feat: Add state wait to tensor board [DET-8273] (#5009)
- 2bca637 fix: support setting agent username and group in user APIs. (#5055)
- 9fc9865 ci(weekly-vuln-scan): remove superseded workflow [INFENG-94] (#5051)
- c29e3ce fix: Trial.TotalCheckpointSize incorrect [DET-8399] (#5063)
- cef4492 docs: Slurm installation feedback from user installs (#5048)
- bfbe141 build: speed up parallel runs for react make (#5065)
- f075bff chore: add lint for Prettier and Ignore format code commit (#5059)
- ca9cc81 ci(scan-docker-images): fix sarif upload job [INFENG-94] (#5050)
- de9b47c fix: handle pin when its not defined (#5062)
- 8b8c2c8 feat: permissions to create and view workspaces (#5045)
- 85396eb chore: format code (#5058)
- 2890b33 feat: add cluster_admin permission route (#5024)
- 2b3cad5 chore: test PyTorch AMP with gradient aggregation [DET-6105] (#4987)
- d8b6c47 feat: WebUI add RBAC feature switch [DET-8352] (#5036)
- f2fe163 fix: avoid pip executable when upgrading pip in circleci (#5057)
- 7248ef8 fix: set auth cookie client side for external flows [DET-8310] (#4967)
- 999d70b fix: WebUI codeview test mock [DET-8351] (#5047)
- 079e8af feat: Modal to add role to group [DET-8220] (#5022)
- f14e5be chore: bumpenvs for ROCm changes (#5026)
- 62756f5 feat: setup
Prettier
(#5033) - d39b55d chore: add
.prettierrc.js
(#5034) - 161285b chore: personal groups get automatically created for users [DET-8363] (#5025)
- d4ed2e6 ci: work around a bum pyzmq build (#5029)
- 71805d1 chore: replacing globalOnly with isGlobal in web code (#5028)
- cbac359 fix: add alignment in tables (#4995)
- 8fe208f fix: update shell and star icons (#5019)
- 2e7458b fix: remove
det deploy aws
vpc deployment type, fix govcloud agent AMI. (#5023) - 2da9d14 chore: permissions and permission summary proto (#5020)
- 8abd939 feat: add jupyter notebook files support (#5004)
- 86ffb08 fix: duplicate checkpoints returned by listing checkpoint routes (#4894)
- 273eb84 feat: add the useSettings to the view code (#4961)
- 8636ad4 chore: update slurm-known-issues (#4892)
- f7322ad feat: add stub RBAC API (#4990)
- bbf5967 chore: split out ui store and its actions [DET-8218]
- 1b907fb feat: Redesigning active-state for experiment, trial, and task [DET-7278] [DET-7801] (#4420)
- 946be9e fix: don't attempt to remove zero checkpoints (#4986)
- 7fdf0e0 fix: .detignore interprets wildcards like .gitignore [DET-7094] (#4998)
- be5471d test: WebUI add test for settings account (#4980)
- 311cb11 feat: pin experiments (#4925)
- b67a11b chore: bump version: 0.19.3-dev0 -> 0.19.4-dev0
- b467929 docs: add release notes for 0.19.3 (#4997)
- d3e8fd5 docs: proto required version to 3.15 to support optionals (#4992)
- 75cf3c5 chore: add streaming to bindings and use it (#4942)
- f6f2f54 feat: add cache control headers to static web assets [DET-7450] (#5005)
- 373df82 chore: add expconf environment.pbs (#4982)
- b392999 feat: add encoding to the file path (#4981)
- b1e91f0 docs: Updates for ROCm support with Slurm (FOUNDENG-128) (#4985)
- 664b58e fix: update icon codes (#5002)
- e701ac0 perf: cache grpcutil.GetUser() result (#4991)
- 37e3e1e feat: user group CLI and RBAC feature flag [DET-7889,DET-8210] (#4637)
- ee0b595 test: master test-intg
TestDeleteCheckpoints
stability. (#4988) - a5607bf feat: Add no permissions warning page[DET-8227] (#4950)
- 067c410 feat: WebUI add view user profile [DET-8228] (#4960)
- 3501823 refactor: rename
usergroup.APIServer
->usergroup.UserGroupAPIServer
(#4933) - d35e24e refactor: bunify
grpcutil.GetUser
. (#4976) - 3a873d2 ci: gke version bump. (#4983)
- f9b9872 chore: reduce and log effective store state changes (#4952)
- 95f82b1 feat: can see users' permissions if view_permissions enabled [DET-8222] (#4984)
- 580e5d9 feat: Frontend uses permission store to clear actions [DET-8215] (#4965)
- a444c01 feat: use task names for interactive task page titles. (#4954)
- d90c0ce fix: address early loading state resolution [DET-8320] (#4978)
- f6d9ea6 fix: make hp search look good on mobile [DET-8321] (#4973)
- 79c60fe fix: correct overflow action buttons [DET-8322] (#4979)
- 2ca4ad2 fix: remove duplicate Admin Guide tile (#4975)
- c15db09 feat: make table row inline (#4962)
- 449c194 feat: allow specifying Fluent Bit container UID/GID on Kubernetes [DET-8012] (#4963)
- f76c7ac chore: recursively unwrap caught exceptions for type checks (#4966)
- 134151e fix: WebUI config download [DET-8323] (#4974)
- 829c30e feat: add batch register and deletion of checkpoints from experiment [DET-8130] (#4931)
- c3ef4bd chore: revert "chore: secure echo with default authentication [DET-7405] [DET-7378] (#4267)" (#4971)
- e5252f1 fix: reduce settings api calls [DET-8307] (#4970)
- 2d0af46 feat: add sorting to the tree and fix css for the tree (#4858)
0.19.3
Changelog
- 17f6d80 chore: bump version: 0.19.3-rc4 -> 0.19.3
- ba4c8fb docs: add release notes for 0.19.3 (#4997)
- 620cf69 chore: bump version: 0.19.3-rc3 -> 0.19.3-rc4
- 1cd716a feat: allow specifying Fluent Bit container UID/GID on Kubernetes [DET-8012] (#4963)
- 041cae9 chore: bump version: 0.19.3-rc2 -> 0.19.3-rc3
- a7364af fix: correct overflow action buttons [DET-8322] (#4979)
- 7557b06 chore: bump version: 0.19.3-rc1 -> 0.19.3-rc2
- 83df8cc fix: remove duplicate Admin Guide tile (#4975)
- f7824e4 fix: WebUI config download [DET-8323] (#4974)
- 11fe52c chore: bump version: 0.19.3-rc0 -> 0.19.3-rc1
- 1af55aa chore: revert "chore: secure echo with default authentication [DET-7405] [DET-7378] (#4267)" (#4971)
- dbe7008 fix: reduce settings api calls [DET-8307] (#4970)
- fa2a825 chore: bump version: 0.19.3-dev0 -> 0.19.3-rc0
- 05d713e chore: lock api state for backward compatibility check
- 0127d7d chore: secure echo with default authentication [DET-7405] [DET-7378] (#4267)
- e2512cf feat: adjust scrollbar color by theme (#4964)
- e1971c8 fix: associate allocation sessions with users (#4949)
- 6c0ea87 chore: fix a typo in py generator (#4938)
- a5ea7e8 chore: add question issue (#4959)
- c87cc1f ci(test-unit): remove debug code (#4947)
- fedee52 test: remove ds test from p2 (#4951)
- 4b565ac ci: run deepspeed on g4dn instances (#4946)
- b2765f0 feat: WebUI 404 not found page [DET-8226] (#4937)
- f1f77c6 refactor: AuthZ for trials [DET-8211] (#4940)
- 522f9f3 fix: allow forking an archived experiment [DET-8277] (#4944)
- d346f3f chore: test apex checkpointing [DET-7886] (#4904)
- 6a3a455 chore: ensure isAuthError can see into wrapped exceptions (#4934)
- c94a91c ci(test-unit): accept only status events (#4941)
- 5363d84 docs: slurm jobs do not require gres (#4911)
- 7c12bd2 docs: update required python to 3.7 (#4939)
- acd2ba9 feat: add programatic download for the config files (#4907)
- 8f1f2f0 ci(test-unit): flail productively (#4936)
- bd2db37 chore: address low hanging security updates (#4872)
- bf61b08 fix: remove prevUser constraint (#4932)
- 948f34a feat: WebUI create user with group info [DET-8221] (#4923)
- a57c909 refactor: AuthZ for experiments [DET-8003] (#4905)
- a93903b feat: helm chart: add OIDC and SCIM options [DPS-204] (#4897)
- ab8e471 test: update yaml file names (#4924)
- 798fca6 docs: fix to hyperlink in release notes (#4895)
- 0fa875c docs: Slurm support updates for 0.19.3 (#4919)
- 99c8f3f chore: fix rebase error (#4922)
- e1632c0 chore: add stream argument to Session._do_request (#4902)
- c3b0fb6 fix: rbac-user-groups merge conflicts and lints.
- f923e79 feat: WebUI group list page [DET-7921, DET-7976] (#4724)
- 710f8f6 fix: rbac-user-groups merge conflicts.
- e9a909d feat: WebUI edit user [DET-7846] (#4680)
- e35fb59 chore: RBAC user groups crud (#4620)
- 1933ef3 feat: migrate patch user logic to grpc server [DET-7909] (#4648)
- e9ab25d feat: pluggable authorization for RBAC. (#4626)
- 12cad9f chore: User Groups SQL (#4519) [DET-7803]
- d551eb4 fix: change /var/cache permissions to mode 775 (#4920)
- 0164be0 fix: GetExperiments error on forked experiment (#4918)
- 9b23d6f ci(test-unit): limit runs to only test-e2e updates (#4915)
- 3dc8651 fix: race condition in agent
container
actor around missingcontainerInfo
. (#4869) - b2caa15 ci(test-unit): fix conditional check syntax (#4913)
- bbf27db ci(test-unit): fix debug line to print payload (#4912)
- c9fdcfa ci(link-artifacts): add initial workflow attempt (#4906)
- 0306d66 chore: resource pool support for PBS (#4884)
- 51355e4 perf: improve
getWorkspaceProjects
api for Quick Search (#4896) - 4ad9c1d chore: change import path in generated bindings (#4900)
- fc1aee2 chore: proto build should fail on first error. (#4802)
- 3f68ac2 fix: re-render issue (#4898)
- 0e3c81e feat: GetExperiments to bun (#4813)
- 479beba feat: DeepSpeed CPU offloading (#4875)
- b85c1b3 chore: replace
PropsWithChildren
with explicit children (#4890) - 3f9aacf chore: migrate python sdk to generated bindings [DET-8005] (#4844)
- a3ad849 chore: bump version: 0.19.2-dev0 -> 0.19.3-dev0
- c339e34 docs: add release notes for 0.19.2 (#4877)
- e066d32 chore: set torch_geometric version in example to fix e2e test. (#4889)
- f6580dd perf: set memory cap to improve memory allocation (#4840)
- 25019fa chore: fix limit 0 for /api/v1/trials/:id/workloads (#4886)
- a5c6f79 feat: experiment checkpoint list [DET-8201] [DET-8129] (#4870)
- 95c5126 feat: allow OrderBy in GetExperimentCheckpoints for SortBy SearcherMetric (#4885)
- a5278b1 feat: create quick search to jump to workspace or project (#4837)
- c0b98db build: enable storybook previews (#4874)
- 116baf9 fix: det e describe with multiple trials (#4863)
- cf31c47 ci: fix flakes in test_max_concurrent_trials (#4865)
- 9f5306d chore: test AMP autocast and gradient scaling [DET-7885] (#4702)
- 0f0f82e chore: some cli cleanup (#4859)
- 5e8d8f2 docs: remove misleading redirect (#4883)
- 07e7650 feat: add security.default_task and openshift host options to helm chart [DPS-204] (#4843)
- 30e3393 feat: add disabled prop to ActionDropdown (DET-7937) (#4867)
- 2f0464f fix: downgrade fluentbit to fix tls.vhost issues (#4871)
- 74dd27f build: avoid double testing via e2e-longrunning (#4850)
- f008dcb chore: add controllable logging support [DET-8025] (#4826)
- 4a7c03f fix: remove workloadCount from trial responses; single-trial view fix (#4857)
- 945cd6a chore: document reasons for scaler.update() (#4845)
- 70c0c66 chore: add authz on moving experiments between projects [DET-7750] (#4806)
- 9e132ed fix: remove subprocess import (#4856)
- 6491115 chore: preserve failed action's error message (#4822)
Docker images
docker pull determinedai/determined-master:0.19.3
docker pull determinedai/determined-master:17f6d80b3
docker pull determinedai/determined-master:17f6d80b349011a29f51210a7634806709f99472
docker pull determinedai/determined-dev:determined-master-17f6d80b3
docker pull determinedai/determined-dev:determined-master-17f6d80b349011a29f51210a7634806709f99472
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.19.3
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:17f6d80b3
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:17f6d80b349011a29f51210a7634806709f99472
0.19.2
Changelog
- 8abc3de chore: bump version: 0.19.2-rc2 -> 0.19.2
- 7db3572 docs: add release notes for 0.19.2 (#4877)
- ea7abb8 chore: bump version: 0.19.2-rc1 -> 0.19.2-rc2
- 02831bb fix: downgrade fluentbit to fix tls.vhost issues (#4871)
- 950f1ce chore: bump version: 0.19.2-rc0 -> 0.19.2-rc1
- 78b9c30 fix: remove workloadCount from trial responses; single-trial view fix (#4857)
- d47ee91 chore: bump version: 0.19.2-dev0 -> 0.19.2-rc0
- 584448b fix: job queue experiment restore (#4797)
- ef83374 fix: set gc policy [DET-8018] (#4812)
- 5e4f50b fix: misc view code bug fixes
- 90ca917 chore: TrialContext is not an interface (#4851)
- 7ae3a38 fix: remove ds example (#4852)
- ec7c8e3 chore: fix random/grid searcher bug with max_concurrent_trials (#4836)
- 27ba9cc fix: enable moving jobs around w/o assuming the full set [DET-8015] (#4766)
- 4ca0fd4 fix: rps should correctly ignore other rps job msgs [DET-8214] (#4848)
- 9a9e05f fix: fit long name (#4825)
- d66d235 fix: remove duplicate loading animation (#4839)
- 80a5507 fix: remove non-model-hub mmdet tests (#4846)
- 29c083e fix: pass user ids for this user ids filter (#4842)
- 5735ea3 fix: do not bypass torch.distributed.launch for single-slot trials (#4838)
- 80735bc fix: handle avgMetrics response on individual trials of multi-trial experiment (#4821)
- 83f5a1a fix: correctly display nested categorical hyperparameters [DET-8074] (#4818)
- 7ffdc3d feat: rolling upgrades support for
det deploy aws
[DET-7853]. (#4829) - b161957 fix: begin standardizing API pagination behavior in CLI. (#4833)
- ab0df53 fix: make allocation saves idempotent (#4695)
- ce376e9 build: set shared web to use xlarge resource class (#4824)
- cf8ebde fix: canceling all experiment trials should cancel experiment (#4759)
- 16e0cc1 ci: fix test-cli-win. (#4834)
- 53316c6 chore: rename a file to prevent API breakages (#4831)
- 345a3b1 ci: Fix publish_helm syntax errors [INFENG-1] (#4819)
- dbccd3c ci: add a checkbox to the PR template (#2969)
- 726fe80 chore: promote Session to a first-class citizen (#4787)
- 2cb0c9b test: webui user management unit test [DET-7968] (#4809)
- 0f4bf86 chore: deprecate mmdetection example in favor of model-hub version (#4816)
- 6815ddf perf: reduce user settings api call (#4790)
- 208afc6 fix: use fluent version 1.9.3 everywhere by default. (#4814)
- 78f40f1 chore: UI fix and improvements (#4747)
- 170000d fix: get trial datapoints from trial comparison/summarization endpoint (#4796)
- 5bda3fc fix: always override protobufAny description in openapi spec (#4811)
- 4bd7c16 fix: reconcile metrics proto, move
det trial describe
to the new API. [DET-7617] (#4746) - 69f5dfb chore: fix ListValue types in swagger spec (#4801)
- 3d459fc ci: fix GHA syntax better [INFENG-1]
- 399148b ci: fix GHA workflow boolean syntax [INFENG-1]
- 6e3bb53 ci: Remove unnecessary quotes [INFENG-1] (#4810)
- 17773b9 chore: share copy to clipboard btn (#4799)
- 7805d59 fix: correct job queue table bugs [DET-8069] (#4804)
- bbbde9c fix: hide
Delete
button if user is not a creator or admin (#4805) - b445352 fix: speed up
det deploy aws
stack updates. (#4793) - 7bcb26e style: update number input error style for dark mode (#4772)
- 7c867e7 refactor: authz interface for projects and workspaces [DET-8002] (#4721)
- e96dca9 fix: grid view on Workspaces and Projects pages show all items [DET-8031] (#4794)
- 441d1c6 fix: job queue pagination (#4756)
- 3191b4c chore:
react-router-dom
partial update part1 (#4788) - cde91c8 feat: Async deleting workspaces and projects (from CLI) [DET-7821] (#4675)
- 003ddd8 fix: directly return object-not-found errors instead of rewrapping them (#4791)
- c0794f5 fix: jupyterLab modal poping issue (#4792)
- efe0fe7 fix: use jupyter icon in navigation side bar (#4786)
- 8cf1517 feat: deepspeed cpu offloading example (#4623)
- 55542b7 fix: tab routing issue in resource pool (#4789)
- 7cd0e51 perf: improve too many user api call (#4763)
- 69c2dfa test: Fix cluster utils cluster_slots() API (#4784)
- 56f6469 fix: use correct experiment list offset when deleting an experiment [DET-7880] (#4754)
- 0dd9e2a chore: Use bindings.v1File instead of ContextItem (#4779)
- e7e0ab2 fix: remove core external dependency from shared (#4782)
- ea3f257 feat: view code UI (#4473)
- 550667b chore: disable positional args in bindings.py classes (#4777)
- 0d5f3eb docs: Add release note for Slurm feature (#4778)
- 047b6ba fix: lint-python ci test (#4774)
- cbc0ba1 refactor: reduce unneeded api calls [DET-7451] (#4771)
- ccf20c5 fix: gpt_neox deepspeed example (#4622)
- f927431 chore: update shared tester git url format (#4773)
- e0ac8e8 fix: label filter in model registry (#4769)
- 99878b7 test: add tests for utils/service (#4749)
- 228744e chore: expose Avatar props through AvatarCard (#4765)
- ad7767c chore: upgrade swagger generator from 2.4.14 to 2.4.27 (#4738)
- 9113e7e Fix a couple more helm action typos [INFENG-1]
- e894408 Fix helm workflow typos / indentation [INFENG-1]
- 4e10f82 ci: add helm repo [infeng 1] (#4725)
- 57842f9 chore: bump version: 0.19.1-dev0 -> 0.19.2-dev0
- 1f5b043 docs: add release notes for 0.19.1 (#4768)
- c478c01 fix: tensorboard metrics step count [DET-8028] (#4761)
- d78c8fc ci: re-enable gke shell logs test fixed d74ef5 (#4760)
- 7d55063 chore: Remove obsolete workloads from Trials API (#4703)
- c6579ef refactor: solidify rm interface [DET-7852, DET-7984] (#4705)
- b303b16 fix: allow changing max_length units in HP Search (#4755)
- 7c98baa ci: remove trent from shared codeowners (#4757)
- d74ef5a ci: shells should generate keys, even with empty 'data' field (#4744)
Docker images
docker pull determinedai/determined-master:0.19.2
docker pull determinedai/determined-master:8abc3decd
docker pull determinedai/determined-master:8abc3decdc2c30813dcf674f19d1beb25eeb51e8
docker pull determinedai/determined-dev:determined-master-8abc3decd
docker pull determinedai/determined-dev:determined-master-8abc3decdc2c30813dcf674f19d1beb25eeb51e8
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.19.2
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:8abc3decd
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:8abc3decdc2c30813dcf674f19d1beb25eeb51e8
0.19.1
Changelog
- 7cc6107 chore: bump version: 0.19.1-rc2 -> 0.19.1
- 5feb1dc docs: add release notes for 0.19.1 (#4768)
- 60b06de chore: bump version: 0.19.1-rc1 -> 0.19.1-rc2
- 6797dba fix: tensorboard metrics step count [DET-8028] (#4761)
- b7ef726 chore: bump version: 0.19.1-rc0 -> 0.19.1-rc1
- 1706a0c fix: allow changing max_length units in HP Search (#4755)
- 086942f chore: bump version: 0.19.1-dev0 -> 0.19.1-rc0
- 14cff26 ci: unversion workflows (#4752)
- f080a63 chore: lock api state for backward compatibility check
- f264281 fix: Write change-password script in /tmp instead of CWD (#4677)
- ebc79c2 fix: python sdk can parse master output again (#4745)
- 2b41576 chore: update docker images names (#4727)
- 286194d fix: deepspeedtrial validation batch size computation (#4743)
- ce78067 chore: [Ant Design] replace old menu with new menu (#4741)
- 6de74a5 docs: add release note for fix for searcher early termination bug (#4739)
- bf6fff3 fix: url-encode description of notebook and tensorboard (#4718)
- 9334945 fix: fix an issue with forbidden api actions causing logout (#4737)
- e695763 style: fix mobile exp header [DET-7975] (#4733)
- 73fe5b6 fix: hardcode pathname instead of using
paths
(#4740) - a071fc6 test: test cases for
shared/utils/routes.ts
[DET-7902] (#4706) - e5fa6fd style: remove styling that forced padding to be 0 (#4734)
- d8d7e25 fix: remove the default theme from initialization (#4698)
- 5adbf42 feat: add spinner to show trial fetching (#4683)
- 464f68c test: add tests for experiment detail page [DET-7979] (#4723)
- a09e81d fix: cursor in modal text field jumps to end of input (#4691)
- 3fcb985 chore: add regex in
InlineEditor
[DET-7518] (#4716) - 5ef83ba chore: share sort utilities [DET-7970] (#4711)
- feabd45 fix: breadcrumb text color (#4720)
- bd542a0 fix: resolve issues around
InlineEditor
[DET-7914] (#4713) - 2ea7946 test: add tests for settings page [DET-7966] (#4717)
- 2724b70 chore: Remove unnecessary imports and fields in proto (#4710)
- eef660f fix: WebUI workspace pagination [DET-7927] (#4700)
- aad8011 refactor: authz provider implementation and authz users basic implementation (#4676)
- 2c575fd fix: record operations at the right places around shutting down (#4719)
- 9ab7dae chore: clear selected item when clear filters (#4714)
- b089329 feat: One-Click Hyperparameter Search [DET-7537] [DET-7538] (#4458)
- 02be458 test: add wait utils test coverage [DET-7959] (#4701)
- bd8664a fix: fix low contrast issue for button styles [DET-7958] (#4692)
- 05800e2 test: update path conditionals for gh workflows (#4708)
- 332270a style: fix doc tile styling (dark mode support and responsive) [DET-7955] (#4709)
- c78b152 fix: mark all 4xx api failures as auth failure (#4690)
- 972954d feat: Connect trial UI to workloads API; pass sort/filter to API (#4407)
- b97198b test: add samlauth tests (#4685)
- 2c26b46 docs: add rest api reference link and rewrite rest api doc (#4688)
- c2968d6 docs: port slurm deployment to oss docs (#4653)
- 3623b0c test: WebUI interaction test for page [DET-7894] (#4689)
- 55aa326 refactor: test cases for
ActionDropdown
(#4699) - 938a486 fix: push-shared target's directory change (#4672)
- da5f7fe fix: keyboard doesnt show for inline editor in mobile [DET-7519] (#4659)
- 0016fd7 fix: word break in description (#4697)
- 6a8856c fix: move some libs in package.json (#4687)
- 20d48be chore: support enum sizes for avatar (#4686)
- 84631ab test: add test cases for
string.ts
(#4679) - 8d2a821 test: create interaction tests for action dropdown [DET-7895] (#4684)
- 5c1679d test: add test coverage for shared error utilities [DET-7900] (#4666)
- ed65d20 ci(lint-python): migrate to gha workflow (#4639)
- 4c3e9f2 test: add test cases for
Image.tsx
(#4667) - aedaa58 chore: bump version: 0.19.0-dev0 -> 0.19.1-dev0
- 63b2dac docs: add release notes for 0.19.0 (#4671)
- 4eeaa51 chore: update live docs script for extension change (#4678)
- 53ed638 test: add test cases for
Icon.tsx
(#4664) - 2a115c4 test: add unit tests for logger class [DET-7901] (#4674)
- 20b682a feat: add new PyTorchCallbacks [DET-7760] (#4500)
- a9f4a87 refactor: remove unused code in model version detail page (#4670)
- 725c74f fix: persist task state update in interactive task view (#4662)
- 3115509 feat: Create user UI [DET-7847] (#4665)
- fac341c fix: Count only active tasks in cluster info board (#4658)
- 9651e0d chore: update codecov badge to reflect web only (#4661)
- 69a8668 fix: comment to gen swagger for model def API [DET-7926] (#4657)
- 17f3926 test: utils/set unit tests [DET-7904] (#4655)
- a42bd97 fix: description overflowing table cell (#4656)
- 65ba5d6 test: add test cases for
AvatarCard
(#4650) - ad70ce8 feat: task specific actions to job overflow menu (#4638)
- dc1ecfb ci(lint-bindings): migrate to gha workflow (#4642)
- 6b341f2 ci(lint-go): migrate to gha workflow (#4636)
- cd5cbec docs: fix a typo in docs for Elasticsearch-backed logging (#4228)
Docker images
docker pull determinedai/determined-master:0.19.1
docker pull determinedai/determined-master:7cc610754
docker pull determinedai/determined-master:7cc610754b2f6828240e07cb222a31da71df4f10
docker pull determinedai/determined-dev:determined-master-7cc610754
docker pull determinedai/determined-dev:determined-master-7cc610754b2f6828240e07cb222a31da71df4f10
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:0.19.1
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:7cc610754
docker pull nvcr.io/isv-ngc-partner/determined/determined-master:7cc610754b2f6828240e07cb222a31da71df4f10