-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Eric contracts #2
base: main
Are you sure you want to change the base?
Commits on Oct 10, 2024
-
[clang] Implement TTP P0522 pack matching for deduced function templa…
…te calls. (llvm#111457) Clang previously missed implementing P0522 pack matching for deduced function template calls. Fixes llvm#111363
Configuration menu - View commit details
-
Copy full SHA for 4dadf42 - Browse repository at this point
Copy the full SHA 4dadf42View commit details -
[mlir][llvmir] Added extra builders for CallInstrinsicOp (llvm#111664)
Extra builders for CallIntrinsicOp. This is inspired by the comment from @antiagainst from [here](llvm#108933 (comment)).
Configuration menu - View commit details
-
Copy full SHA for 741ad3a - Browse repository at this point
Copy the full SHA 741ad3aView commit details -
[Clang] prevent recovery call expression from proceeding with explici…
…t attributes and undeclared templates (llvm#107786) Fixes llvm#107047 Fixes llvm#49093
Configuration menu - View commit details
-
Copy full SHA for 1fa3c85 - Browse repository at this point
Copy the full SHA 1fa3c85View commit details -
[DAGISel] Keep flags when converting FP load/store to integer (llvm#1…
…11679) This DAG combine replaces a floating-point load/store pair which has no other uses with an integer one, but did not copy the memory operand flags to the new instructions, resulting in it dropping the volatile flag. This optimisation is still valid if one or both of the instructions is volatile, so we can copy over the whole MachineMemOperand to generate volatile integer loads and stores where needed.
Configuration menu - View commit details
-
Copy full SHA for 1e49670 - Browse repository at this point
Copy the full SHA 1e49670View commit details -
[clang][bytecode] Check new builtins for integer types (llvm#111801)
These might also be called with vectors, but we don't support that.
Configuration menu - View commit details
-
Copy full SHA for f1eac77 - Browse repository at this point
Copy the full SHA f1eac77View commit details -
[flang][driver] rename flang-new to flang (llvm#110023)
This does a global rename from `flang-new` to `flang`. I also removed/changed any TODOs that I found related to making this change. --------- Co-authored-by: H. Vetinari <[email protected]> Co-authored-by: Andrzej Warzynski <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 06eb10d - Browse repository at this point
Copy the full SHA 06eb10dView commit details -
[MLIR][LLVM] Fix nameless global import to support use before def case (
llvm#111797) This commit fixes a bug in the import of nameless globals. Before this change, the fake symbol names were only generated during the transformation of the definition. This caused issues when the symbol was used before it was defined.
Configuration menu - View commit details
-
Copy full SHA for 54d3cf1 - Browse repository at this point
Copy the full SHA 54d3cf1View commit details -
[InstCombine] Convert @log to @llvm.log if the input is known positiv…
…e. (llvm#111428) Similar to 112aac4, this converts log libcalls to llvm.log.f64 intrinsics if we know they do not set errno, as the input is not zero and not negative. As log will produce errno if the input is 0 (returning -inf) or if the input is negative (returning nan), we also perform the conversion when we have noinf and nonan.
Configuration menu - View commit details
-
Copy full SHA for 5184d76 - Browse repository at this point
Copy the full SHA 5184d76View commit details -
[mlir] add overload
createDIScopeForLLVMFuncOp
function (llvm#111689)follow up work of llvm#106229, add create pass overload function to create pass. --------- Co-authored-by: jingzec <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d124b98 - Browse repository at this point
Copy the full SHA d124b98View commit details -
[clang][HLSL] Add sign intrinsic part 4 (llvm#108396)
- Add handling for unsigned integers to hlsl_elementwise_sign - Use `select` instead of adding dx and spirv intrinsics for unsigned integers as [discussed previously ](llvm#101988 (comment)) fixes llvm#70078 ### Related PRs - llvm#101987 - llvm#101988 - llvm#101989 cc @farzonl @pow2clk @bob80905 @bogner @llvm-beanz
Configuration menu - View commit details
-
Copy full SHA for 99608f1 - Browse repository at this point
Copy the full SHA 99608f1View commit details -
[lldb][docs] Add link to RISC-V tracking issue in Platform Support
Saves me searching for this every time someone asks.
Configuration menu - View commit details
-
Copy full SHA for 993de55 - Browse repository at this point
Copy the full SHA 993de55View commit details -
[X86] Fold EXTRACT_SUBVECTOR(ONEUSE(EXTRACT_SUBVECTOR(V,C1))),C2) - E…
…XTRACT_SUBVECTOR(V,C1+C2) (llvm#111685) Extract from the original source vector whenever possible. This removes a number of dependency bottlenecks and helps a number of shuffle combining cases: either by allowing us to avoid a cross-lane variable shuffle on a slow target by keeping the instruction count below the threshold, or on fast targets make it easier to recognise that the subvectors all came form the same source.
Configuration menu - View commit details
-
Copy full SHA for 3e4ee76 - Browse repository at this point
Copy the full SHA 3e4ee76View commit details -
[lldb] Check for Python 'packaging' module at configuration time (llv…
…m#111747) This module is used in various helper scripts since llvm#93712
Configuration menu - View commit details
-
Copy full SHA for 7890919 - Browse repository at this point
Copy the full SHA 7890919View commit details -
AMDGPU/GlobalISel: Insert m0 initialization before sextload/zextload (l…
…lvm#111720) Fixes missing m0 initialize for pre-gfx9 targets with local extending loads.
Configuration menu - View commit details
-
Copy full SHA for c36f902 - Browse repository at this point
Copy the full SHA c36f902View commit details -
[AMDGPU][NewPM] Fill out AMDGPU addMachineSSAOptimizations (llvm#111658)
Implement the addMachineSSAOptimizations passes for AMDGPU. Porting the other generic passes in this category is WIP.
Configuration menu - View commit details
-
Copy full SHA for 039e6f8 - Browse repository at this point
Copy the full SHA 039e6f8View commit details -
[LTO] Run Argument Promotion before IPSCCP (llvm#111163)
Run ArgumentPromotion before IPSCCP in the LTO pipeline, to expose more constants to be propagated. We also run PostOrderFunctionAttrs to improve the information available to ArgumentPromotion's alias analysis, and SROA to clean up allocas.
Configuration menu - View commit details
-
Copy full SHA for b9754e9 - Browse repository at this point
Copy the full SHA b9754e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for a7a4daa - Browse repository at this point
Copy the full SHA a7a4daaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 003375f - Browse repository at this point
Copy the full SHA 003375fView commit details -
[clang-tidy] Portability Template Virtual Member Function Check (llvm…
…#110099) Introduced a new check that finds cases when an uninstantiated virtual member function in a template class causes cross-compiler incompatibility.
Configuration menu - View commit details
-
Copy full SHA for 6d8e966 - Browse repository at this point
Copy the full SHA 6d8e966View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb34008 - Browse repository at this point
Copy the full SHA bb34008View commit details -
[SimplifyLibCalls] fdim constant fold (llvm#109235)
2nd PR to fix llvm#108695 based on llvm#108702 --------- Signed-off-by: Kushal Pal <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3645c64 - Browse repository at this point
Copy the full SHA 3645c64View commit details -
[lldb] Add early CMake check for 'make' tool (llvm#111531)
Many LLDB's dotest.py based tests require the `make` tool. If it's not found in Path, they fail with an obscure error and show up as `UNRESOLVED`. On Windows, llvm-lit takes care of MSYS based testing tools like cat, printf, etc., but `make` is not part of that. Let's catch the situation early and check for it at configuration time. This error isn't fatal: It should fail the build, but not immediately stop the configuration process. There might be other issues further down the line that can be caught in the same buildbot run.
Configuration menu - View commit details
-
Copy full SHA for 0e91323 - Browse repository at this point
Copy the full SHA 0e91323View commit details -
[lldb][test] elf-memory.test requires LLDB build with Python support (l…
…lvm#111810) Otherwise it fails with "error: Embedded script interpreter unavailable. LLDB was built without scripting language support."
Configuration menu - View commit details
-
Copy full SHA for 99cec7f - Browse repository at this point
Copy the full SHA 99cec7fView commit details -
[mlir][emitc] Fix the error with closing bracket in CppEmitter in swi…
…tchOp (llvm#110269) While working with `emitc::SwitchOp`, it was identified that `mlir-translate` emits **invalid C code** for switch. This commit fixes the issue with the closing bracket in `CppEmitter` within `printOperation` for `emitc::SwitchOp`.
Configuration menu - View commit details
-
Copy full SHA for a1c9dd7 - Browse repository at this point
Copy the full SHA a1c9dd7View commit details -
Revert "Reapply "[AMDGPU][GlobalISel] Fix load/store of pointer vecto…
…rs, buffer.*.pN (llvm#110714)" v2 (llvm#111708)" This reverts commit 4b4a0d4. New test fails on buildbots https://lab.llvm.org/buildbot/#/builders/63/builds/2039 https://lab.llvm.org/buildbot/#/builders/127/builds/1055
Configuration menu - View commit details
-
Copy full SHA for 8a849a2 - Browse repository at this point
Copy the full SHA 8a849a2View commit details -
[clang][x86] Add constexpr support for _mm_cvtsi32_ss/_mm_cvt_si2ss/_…
…mm_cvtsi64_ss SSE1 intrinsics Followup to llvm#111001
Configuration menu - View commit details
-
Copy full SHA for 8306114 - Browse repository at this point
Copy the full SHA 8306114View commit details -
[libc++] Remove nonexistent directory from check-generated-output (ll…
…vm#111746) The libcxx/benchmarks directory was moved to libcxx/test/benchmarks, which is already checked by that grep command.
Configuration menu - View commit details
-
Copy full SHA for ea2b897 - Browse repository at this point
Copy the full SHA ea2b897View commit details -
[Fuchsia][CMake] Set output name for libc++ shared library (llvm#111791)
This is a dependency of llvm#80007.
Configuration menu - View commit details
-
Copy full SHA for b94c763 - Browse repository at this point
Copy the full SHA b94c763View commit details -
[runtimes] Always define cxx_shared, cxx_static & other targets (llvm…
…#80007) This patch always defines the cxx_shared, cxx_static & other top-level targets. However, they are marked as EXCLUDE_FROM_ALL when we don't want to build them. Simply declaring the targets should be of no harm, and it allows other projects to mention these targets regardless of whether they end up being built or not. This patch basically moves the definition of e.g. cxx_shared out of the `if (LIBCXX_ENABLE_SHARED)` and instead marks it as EXCLUDE_FROM_ALL conditionally on whether LIBCXX_ENABLE_SHARED is passed. It then does the same for libunwind and libc++abi targets. I purposefully avoided to reformat the files (which now has inconsistent indentation) because I wanted to keep the diff minimal, and I know this is an area of the code where folks may have downstream diffs. I will re-indent the code separately once this patch lands. This is a reapplication of 79ee034, which was reverted in a353909 because it broke the TSAN and the Fuchsia builds. Resolves llvm#77654 Differential Revision: https://reviews.llvm.org/D134221
Configuration menu - View commit details
-
Copy full SHA for 917ada3 - Browse repository at this point
Copy the full SHA 917ada3View commit details -
[MLIR][GPU-LLVM] Add in-pass signature update for opencl kernels (llv…
…m#105664) Default to Global address space for memrefs that do not have an explicit address space set in the IR. --------- Co-authored-by: Victor Perez <[email protected]> Co-authored-by: Jakub Kuderski <[email protected]> Co-authored-by: Victor Perez <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f8b7a65 - Browse repository at this point
Copy the full SHA f8b7a65View commit details -
[clang][bytecode] Fix temporary lvalue base expression (llvm#111808)
We need to use the MaterializeTemporaryExpr here so the checks in ExprConstant.cpp do the right thing.
Configuration menu - View commit details
-
Copy full SHA for 55d51dd - Browse repository at this point
Copy the full SHA 55d51ddView commit details -
[lldb][test] Use $(STRIP) instead of strip in API tests (Darwin-only …
…change) (llvm#111816) This makes tests more portable. Make variables for LLVM utils are passed to `make` on Darwin as well. Co-authored-by: Vladimir Vereschaka <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b773da0 - Browse repository at this point
Copy the full SHA b773da0View commit details -
[LLVM][DOCS] Add documentation for 'host' and 'Native' options in LLV…
…M_TARGETS_TO_BUILD. (llvm#111382) From llvm#111356
Configuration menu - View commit details
-
Copy full SHA for 36a0d44 - Browse repository at this point
Copy the full SHA 36a0d44View commit details -
VectorCombine: lift one-use limitation in foldExtractedCmps (llvm#110902
Configuration menu - View commit details
-
Copy full SHA for 1f919aa - Browse repository at this point
Copy the full SHA 1f919aaView commit details -
[libc++] __uglify internal member names of iterators in
bitset
(llv……m#111127) [template.bitset.general] indicates that `bitset` shouldn't have member typedef-names `iterator` and `const_iterator`. Currently libc++'s typedef-names are causing ambiguity in name lookup, which isn't conforming. As these iterator types are themselves useful, I think we should just use __uglified member typedef-names for them. Fixes llvm#111125
Configuration menu - View commit details
-
Copy full SHA for 159d694 - Browse repository at this point
Copy the full SHA 159d694View commit details -
[ci] Don't add check-all target when pstl project is enabled (llvm#11…
…1803) Fixes llvm#110265 Adding check-all causes us to run some tests twice if a project specific target like check-clang is also added. check-pstl is an alternative but as far as I can tell, check-all does not include this so we have not been running the tests in CI anyway. When I tried to run check-pstl locally I got a lot of compiler errors but have not found any instructions on how to setup a correct build environment. Even if such instructions exist, it's probably more than we want to do in CI. According to Louis Dionne, the project is probably not active. So if it's ever revived it'll be up to the new contributors to enable testing.
Configuration menu - View commit details
-
Copy full SHA for 9014920 - Browse repository at this point
Copy the full SHA 9014920View commit details -
[flang] correctly deal with bind(c) derived type result ABI (llvm#111678
) Derived type results of BIND(C) function should be returned according the the C ABI for returning the related C struct type. This currently did not happen since the abstract-result pass was forcing the Fortran ABI for all derived type results. use the bind_c attribute that was added on call/func/dispatch in FIR to prevent such rewrite in the abstract result pass, and update the target-rewrite pass to deal with the struct return ABI. So far, the target specific part of the target-rewrite is only implemented for X86-64 according to the "System V Application Binary Interface AMD64 v1", the other targets will hit a TODO, just like for BIND(C), VALUE derived type arguments. This intends to deal with llvm#102113.
Configuration menu - View commit details
-
Copy full SHA for 480e7f0 - Browse repository at this point
Copy the full SHA 480e7f0View commit details -
[Dexter] Remove outdated imp dependency (llvm#111833)
Fixes: llvm#111815 This patch replaces usage of the python `imp` library, which is deprecated since python3.4 and removed in python3.12, with the `importlib` library. As part of this update the repeated find_module+load_module pattern is moved into a utility function, since the importlib equivalent is much more verbose.
Configuration menu - View commit details
-
Copy full SHA for 6779376 - Browse repository at this point
Copy the full SHA 6779376View commit details -
[RISCV] Remove {s,u}int_to_fp custom op action for f16/bf16 (llvm#111471
) It turns out that {s,u}int_to_fp nodes get their operation action from their operand's type, not the result type, so we don't need to set it for fp16 or bf16. vp_{s,u}int_to_fp uses the result type though so we need to keep it. This also means that we can lower int_to_fp for fixed length bf16 vectors already, so this adds tests for that. The cost model test changes are due to BasicTTIImpl's getCastInstrCost not taking into account that int_to_fp needs its legal type swapped. This can be fixed in a later patch, but its worth noting that the affected types in the tests currently crash when lowered anyway (due to them needing split at LMUL > 8)
Configuration menu - View commit details
-
Copy full SHA for a3cd269 - Browse repository at this point
Copy the full SHA a3cd269View commit details -
Revert "[LTO] Run Argument Promotion before IPSCCP" (llvm#111839)
Reverts llvm#111163, as this was merged prematurely.
Configuration menu - View commit details
-
Copy full SHA for 0a0f100 - Browse repository at this point
Copy the full SHA 0a0f100View commit details -
[MCP] Skip invalidating def constant regs during forward propagation (l…
…lvm#111129) Before this patch, redundant COPY couldn't be removed for the following case: ``` %reg1 = COPY %const-reg ... // There is a def of %const-reg %reg2 = COPY killed %reg1 ``` where this can be optimized to: ``` ... // There is a def of %const-reg %reg2 = COPY %const-reg ``` This patch allows for such optimization by not invalidating defined constant registers. This is safe, as architectures like AArch64 and RISCV replace a dead definition of a GPR with a zero constant register for certain instructions.
Configuration menu - View commit details
-
Copy full SHA for dabb0dd - Browse repository at this point
Copy the full SHA dabb0ddView commit details -
[Coroutines] Support for Custom ABIs (llvm#111755)
This change extends the current method for creating ABI object to allow users (plugin libraries) to create custom ABI objects for their needs. This is accomplished by inheriting one of the common ABIs and overriding one or more of the methods to create a custom ABI. To use a custom ABI for a given coroutine the coro.begin.custom.abi intrinsic is used in place of the coro.begin intrinsic. This takes an additional i32 arg that specifies the index of an ABI generator for the custom ABI object in a SmallVector passed to the CoroSplitPass ctor. The detailed changes include: * Add the llvm.coro.begin.custom intrinsic used to specify the index of the custom ABI to use for the given coroutine. * Add constructors to CoroSplit that take a list of generators that create the custom ABI object. * Extend the CreateNewABI function used by CoroSplit to return a unique_ptr to an ABI object. * Add has/getCustomABI methods to CoroBeginInst class. * Add a unittest for a custom ABI. See doc update here: llvm#111781
Configuration menu - View commit details
-
Copy full SHA for 3737a53 - Browse repository at this point
Copy the full SHA 3737a53View commit details -
Configuration menu - View commit details
-
Copy full SHA for 005e601 - Browse repository at this point
Copy the full SHA 005e601View commit details -
[libc] Clean up 'vasprintf' implementation (llvm#111761)
Summary: This had some leftover references to the old namespace and didn't put restrict on it.
Configuration menu - View commit details
-
Copy full SHA for 545e059 - Browse repository at this point
Copy the full SHA 545e059View commit details -
[AMDGPU][test]update error dasm test for update-mc-test-check script (l…
…lvm#111760) The previous error test line is using a 16bit instruction to indicate an error. However this is a poor pick. The 16bit instructions on AMDGPU is under development and thus, some downstream branches are not showing this exact error message. Changing it to another error dasm code.
Configuration menu - View commit details
-
Copy full SHA for e023d02 - Browse repository at this point
Copy the full SHA e023d02View commit details -
[mlir][linalg][nfc] Delete references to args_in/args_out (llvm#111517)
After the refactor in: * ed22913, the `args_in` and `args_out` attributes are no longer used by `linalg.generic`. This patch removes most the remaining references. I've left out BufferDeallocationInternals.md, which doesn't seem maintained anymore and is quite out of sync with other bits of MLIR (e.g. `test.generic` instead of `linalg.generic`).
Configuration menu - View commit details
-
Copy full SHA for f59b0c7 - Browse repository at this point
Copy the full SHA f59b0c7View commit details -
[lldb][test] Use
xcrun -f strip
for API tests on Darwin (llvm#111842)A follow-up for llvm#111816. This is to fix buildbot failure https://lab.llvm.org/staging/#/builders/195/builds/4242. TestSymbolFileJSON.py doesn't pass with llvm-strip on macOS. Apparently, llvm-strip/llvm-objcopy can't clean symbols from Mach-O nlists.
Configuration menu - View commit details
-
Copy full SHA for 058ede0 - Browse repository at this point
Copy the full SHA 058ede0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 77c842f - Browse repository at this point
Copy the full SHA 77c842fView commit details -
Configuration menu - View commit details
-
Copy full SHA for d2a96d1 - Browse repository at this point
Copy the full SHA d2a96d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 670a461 - Browse repository at this point
Copy the full SHA 670a461View commit details -
Configuration menu - View commit details
-
Copy full SHA for 35bbfbc - Browse repository at this point
Copy the full SHA 35bbfbcView commit details -
Configuration menu - View commit details
-
Copy full SHA for fc467b4 - Browse repository at this point
Copy the full SHA fc467b4View commit details -
[lld-macho] Fix ICF differentiation of safe_thunks relocs (llvm#111811)
In `--icf=safe_thunks` mode, the linker differentiates `keepUnique` functions by creating thunks during a post-processing step after Identical Code Folding (ICF). While this ensures that `keepUnique` functions themselves are not incorrectly merged, it overlooks functions that reference these `keepUnique` symbols. If two functions are identical except for references to different `keepUnique` functions, the current ICF algorithm incorrectly considers them identical because it doesn't account for the future differentiation introduced by thunks. This leads to incorrect deduplication of functions that should remain distinct. To address this issue, we modify the ICF comparison to explicitly check for references to `keepUnique` functions during deduplication. By doing so, functions that reference different `keepUnique` symbols are correctly identified as distinct, preventing erroneous merging and ensuring the correctness of the linked output.
Configuration menu - View commit details
-
Copy full SHA for 97a4324 - Browse repository at this point
Copy the full SHA 97a4324View commit details -
Revert "[flang] correctly deal with bind(c) derived type result ABI" (l…
…lvm#111858) Reverts llvm#111678 Causes ARM failure in test suite. TYPE(C_PTR) result should not regress even if struct ABI no implemented for the target. https://lab.llvm.org/buildbot/#/builders/143/builds/2731 I need to revisit this.
Configuration menu - View commit details
-
Copy full SHA for 4ddc756 - Browse repository at this point
Copy the full SHA 4ddc756View commit details -
[clang] Update string and string_view in lifetimebound tests (llvm#11…
…1737) Removes pragmas like `# 1 "<std>" 1 3` to make line numbers in failing tests more accurate. Use `basic_string_view` instead `string_view` to kick in GSL owner/pointer auto inference.
Configuration menu - View commit details
-
Copy full SHA for a3638f1 - Browse repository at this point
Copy the full SHA a3638f1View commit details -
llvm-reduce: Fix assert on invokes with catchswitch (llvm#111838)
This is the minimal change to avoid the assert. There's an API flaw in invoke instructions where getLandingPad assumes all invoke unwind blocks have landingpads, when some have catchswitch instead. Fixes llvm#111817
Configuration menu - View commit details
-
Copy full SHA for 9839b8c - Browse repository at this point
Copy the full SHA 9839b8cView commit details -
[RISCV] Use RISCVAsmPrinter::EmitToStreamer for EmitHwasanMemaccessSy…
…mbols. (llvm#111792) Add a MCSubtargetInfo& operand so we can control the subtarget for the new calls. The old signature is kept as a wrapper to pass *STI to maintain compatibility. By using EmitToStreamer we are able to compress the instructions when possible.
Configuration menu - View commit details
-
Copy full SHA for c042d8f - Browse repository at this point
Copy the full SHA c042d8fView commit details -
[lldb] Introduce an always-on system log category/channel (llvm#108495)
Add an "always on" log category and channel. Unlike other, existing log channels, it is not exposed to users. The channel is meant to be used sparsely and deliberately for logging high-value information to the system log. We have a similar concept in the downstream Swift fork and this has proven to be extremely valuable. This is especially true on macOS where system log messages are automatically captured as part of a sysdiagnose.
Configuration menu - View commit details
-
Copy full SHA for c77b107 - Browse repository at this point
Copy the full SHA c77b107View commit details -
Configuration menu - View commit details
-
Copy full SHA for f5aec03 - Browse repository at this point
Copy the full SHA f5aec03View commit details -
[Dexter] Set up ComInterface module to be imported correctly (llvm#11…
…1850) Fixes issue added by: llvm#111833 Following the previous commit that changed how Dexter imports modules, the ComInterface module import became broken. This is because it had a different directory structure to other modules, where we want to import single file rather than a dir containing a __init__.py. For this case, an optional extra arg has been added to load_module allowing a filename to be specified, letting us import ComInterface.py directly and fixing the issue.
Configuration menu - View commit details
-
Copy full SHA for 23309d7 - Browse repository at this point
Copy the full SHA 23309d7View commit details -
[mlir][linalg] Introduce transpose semantic to 'linalg.matmul' ops. (l…
…lvm#104783) The main goal of this patch is to extend the semantic of 'linalg.matmul' named op to include per operand transpose semantic while also laying out a way to move ops definition from OpDSL to tablegen. Hence, it is implemented in tablegen. Transpose semantic is as follows. By default 'linalg.matmul' behavior will remain as is. Transpose semantics can be appiled on per input operand by specifying the optional permutation attributes (namely 'permutationA' for 1st input and 'permutationB' for 2nd input) for each operand explicitly as needed. By default, no transpose is mandated for any of the input operand. Example: ``` %val = linalg.matmul ins(%arg0, %arg1 : memref<5x3xf32>, memref<5x7xf32>) outs(%arg2: memref<3x7xf32>) permutationA = [1, 0] permutationB = [0, 1] ```
Configuration menu - View commit details
-
Copy full SHA for 0348373 - Browse repository at this point
Copy the full SHA 0348373View commit details -
[CodeLayout] Do not verify after assigning blocks (llvm#111754)
Rather than invariantly running `F->verify()` when asserts are enabled, run machine IR verification in LIT tests only. Swap `CHECK-PERF` and `CHECK-SIZE` in `code_placement_ext_tsp_large.ll`. Remove `={0,1,true,false}` from flags in tests.
Configuration menu - View commit details
-
Copy full SHA for cb5fbd2 - Browse repository at this point
Copy the full SHA cb5fbd2View commit details -
[Clang] Extend lifetime of temporaries in mem-default-init for P2718R0 (
llvm#86960) Depends on [CWG1815](llvm#108039). Fixes llvm#85613. In [[Clang] Implement P2718R0 "Lifetime extension in range-based for loops"](llvm#76361), we've not implement the lifetime extensions for the temporaries which in `CXXDefaultInitExpr`. As the confirmation in llvm#85613, we should extend lifetime for that. To avoid modifying current CodeGen rules, in a lifetime extension context, the cleanup of `CXXDefaultInitExpr` was ignored. --------- Signed-off-by: yronglin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 25d9688 - Browse repository at this point
Copy the full SHA 25d9688View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2190ffa - Browse repository at this point
Copy the full SHA 2190ffaView commit details -
[clang][frontend] Support applying the annotate attribute to statemen…
…ts (llvm#111841) By allowing AnnotateAttr to be applied to statements, users can place arbitrary information in the AST for later use. For example, this can be used for HW-targeted language extensions that involve specialized loop annotations.
Configuration menu - View commit details
-
Copy full SHA for 73e74e4 - Browse repository at this point
Copy the full SHA 73e74e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for c04b640 - Browse repository at this point
Copy the full SHA c04b640View commit details -
[mlir][tosa] Change the type of profile option to ListOption (llvm#11…
…1214) In tosa valiation pass, change the type of profile option to ListOption. Now TOSA profiles is turned from hierarchical to composable. Each profile is an independent set, i.e. an target can implement multiple profiles. Set the profile option to none by default, and limit to profiles if requested. The profiles can be specified via command line, e.g. $ mlir-opt ... --tosa-validate="profile=bi,mi" which tells the valiation pass that BI and MI are enabled. Change-Id: I1fb8d0c1b27eccd768349b6eb4234093313efb57
Configuration menu - View commit details
-
Copy full SHA for cc9e7cb - Browse repository at this point
Copy the full SHA cc9e7cbView commit details -
[lldb] Fix a variety of LLDB_LOG format strings
LLVM now triggers an assertion when the format string and arguments don't match. Fix a variety of incorrect format strings I discovered when enabling logging with a debug build.
Configuration menu - View commit details
-
Copy full SHA for f2c5aa9 - Browse repository at this point
Copy the full SHA f2c5aa9View commit details -
[alpha.webkit.UncountedCallArgsChecker] Skip std::forward in tryToFin…
…dPtrOrigin. (llvm#111222) Ignore std::forward when it appears while looking for the pointer origin.
Configuration menu - View commit details
-
Copy full SHA for 0fc3e40 - Browse repository at this point
Copy the full SHA 0fc3e40View commit details -
[alpha.webkit.UncountedCallArgsChecker] Add the support for trivial C…
…XXInheritedCtorInitExpr. (llvm#111198)
Configuration menu - View commit details
-
Copy full SHA for 820bab8 - Browse repository at this point
Copy the full SHA 820bab8View commit details -
isUncountedPtr should take QualType as an argument. (llvm#110213)
Make isUncountedPtr take QualType as an argument instead of Type*. This simplifies some code.
Configuration menu - View commit details
-
Copy full SHA for 39a9141 - Browse repository at this point
Copy the full SHA 39a9141View commit details -
[mlir][debug] Support DICommonBlock. (llvm#111706)
A COMMON block is a named area of memory that holds a collection of variables. Fortran subprograms may map the COMMON block memory area to a list of variables. A common block is represented in LLVM debug by DICommonBlock. This PR adds support for this in MLIR. The changes are mostly mechanical apart from small change to access the DICompileUnit when the scope of the variable is DICommonBlock. --------- Co-authored-by: Tobias Gysi <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 36c34ec - Browse repository at this point
Copy the full SHA 36c34ecView commit details -
[runtimes][NFC] Reindent CMake files (llvm#111821)
This is a purely mechanical commit for fixing the indentation of the runtimes' CMakeLists files after llvm#80007. That PR didn't update the indentation in order to make the diff easier to review and for merge conflicts to be easier to resolve (for downstream changes). This doesn't change any code, it only reindents it.
Configuration menu - View commit details
-
Copy full SHA for 43ba97e - Browse repository at this point
Copy the full SHA 43ba97eView commit details -
[NVPTX] Prefer prmt.b32 over bfi.b32 (llvm#110766)
In [[NVPTX] Improve lowering of v4i8](llvm@cbafb6f) @Artem-B add the ability to lower ISD::BUILD_VECTOR with bfi PTX instructions. @Artem-B did this because: ([source](llvm#67866 (comment))) > Under the hood byte extraction/insertion ends up as BFI/BFE instructions, so we may as well do that in PTX, too. https://godbolt.org/z/Tb3zWbj9b However, the example that @Artem-B linked was targeting sm_52. On modern architectures, ptxas uses prmt.b32. [Example](https://godbolt.org/z/Ye4W1n84o). Thus, remove uses of NVPTXISD::BFI in favor of NVPTXISD::PRMT.
Configuration menu - View commit details
-
Copy full SHA for 3f9998a - Browse repository at this point
Copy the full SHA 3f9998aView commit details -
[flang][runtime] Fix runtime crash after bad recoverable OPEN (llvm#1…
…11454) When an OPEN statement with a unit number fails in a recoverable manner, the runtime needs to delete the ExternalFileUnit instance that was created in the unit map. And we do this too soon -- that instance still holds some of the I/O statement state that will be used by a later call into the runtime for EndIoStatement. Move the code that deletes the unit after a failed but recoverable OPEN into ExternalIoStatementBase::EndIoStatement, and don't do things afterwards that would need the I/O statement state that has been destroyed. Fixes llvm#111404.
Configuration menu - View commit details
-
Copy full SHA for c893e3d - Browse repository at this point
Copy the full SHA c893e3dView commit details -
[flang] Fix references to destroyed objects (llvm#111582)
ProgramTree instances are created as the value of a local variable in the Pre(const parser::ProgramUnit &) member function in name resolution. But references to these ProgramTree instances can persist in SubprogramNameDetails symbol table entries that might survive that function call's lifetime, and lead to trouble later when (e.g.) expression semantics needs to deal with a possible forward reference in a function reference in an expression being processed later in expression checking. So put those ProgramTree instances into a longer-lived linked list within the SemanticsContext. Might fix some weird crashes reported on big-endian targets (AIX & Solaris).
Configuration menu - View commit details
-
Copy full SHA for 4f2b65f - Browse repository at this point
Copy the full SHA 4f2b65fView commit details -
[flang] Minor cleanup (move function into /tools.cpp) (llvm#111587)
The semantics utility GetAllNames has declarations in two header files and a definition that really should be in the common utilities source file. Remove the redudant declaration from resolve-names-utils.h and move code from resolve-names-utils.cpp into Semantics/tools.cpp.
Configuration menu - View commit details
-
Copy full SHA for 2f22656 - Browse repository at this point
Copy the full SHA 2f22656View commit details -
[lldb][libc++] Hide all libc++ implementation details from stacktraces (
llvm#108870) This commit changes the libc++ frame recognizer to hide implementation details of libc++ more aggressively. The applied heuristic is rather straightforward: We consider every function name starting with `__` as an implementation detail. This works pretty neatly for `std::invoke`, `std::function`, `std::sort`, `std::map::emplace` and many others. Also, this should align quite nicely with libc++'s general coding convention of using the `__` for their implementation details, thereby keeping the future maintenance effort low. However, this heuristic by itself does not work in 100% of the cases: E.g., `std::ranges::sort` is not a function, but an object with an overloaded `operator()`, which means that there is no actual call `std::ranges::sort` in the call stack. Instead, there is a `std::ranges::__sort::operator()` call. To make sure that we don't hide this stack frame, we never hide the frame which represents the entry point from user code into libc++ code
Configuration menu - View commit details
-
Copy full SHA for 7e16571 - Browse repository at this point
Copy the full SHA 7e16571View commit details -
[flang][runtime][NFC] Fix header guard typo (llvm#111741)
Header guard was in sync with the filename.
Configuration menu - View commit details
-
Copy full SHA for 7026960 - Browse repository at this point
Copy the full SHA 7026960View commit details -
Configuration menu - View commit details
-
Copy full SHA for 99c8557 - Browse repository at this point
Copy the full SHA 99c8557View commit details -
Configuration menu - View commit details
-
Copy full SHA for 453d373 - Browse repository at this point
Copy the full SHA 453d373View commit details -
Configuration menu - View commit details
-
Copy full SHA for 62b3a4b - Browse repository at this point
Copy the full SHA 62b3a4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ba530e6 - Browse repository at this point
Copy the full SHA ba530e6View commit details -
[HLSL][DXIL] Implement WaveGetLaneIndex Intrinsic (llvm#111576)
- add additional lowering for directx backend in CGBuiltin.cpp - add directx intrinsic to IntrinsicsDirectX.td - add semantic check of arguments in SemaHLSL.cpp - add mapping to DXIL op in DXIL.td - add testing of semantics in WaveGetLaneIndex-errors.hlsl - add testing of dxil lowering in WaveGetLaneIndex.ll Resolves llvm#70105
Configuration menu - View commit details
-
Copy full SHA for d36cef0 - Browse repository at this point
Copy the full SHA d36cef0View commit details -
[lldb][debugserver][NFC] Simplify macOS thread name fetching. (llvm#1…
…11684) Remove unnecessary `proc_pidinfo` calling.
Configuration menu - View commit details
-
Copy full SHA for b800ff6 - Browse repository at this point
Copy the full SHA b800ff6View commit details -
Switch DirectX Target to use the Itanium ABI (llvm#111632)
To consolidate behavior of function mangling and limit the number of places that ABI changes will need to be made, this switches the DirectX target used for HLSL to use the Itanium ABI from the Microsoft ABI. The Itanium ABI has greater flexibility in decisions regarding mangling of new types of which we have more than a few yet to add. One effect of this will be that linking library shaders compiled with DXC will not be possible with shaders compiled with clang. That isn't considered a terribly interesting use case and one that would likely have been onerous to maintain anyway. This involved adding a function to call all global destructors as the Microsoft ABI had done. This requires a few changes to tests. Most notably the mangling style has changed which accounts for most of the changes. In making those changes, I took the opportunity to harmonize some very similar tests for greater consistency. I also shaved off some unneeded run flags that had probably been copied over from one test to another. Other changes effected by using the new ABI include using different types when manipulating smaller bitfields, eliminating an unnecessary alloca in one instance in this-assignment.hlsl, changing the way static local initialization is guarded, and changing the order of inout parameters getting copied in and out. That last is a subtle change in functionality, but one where there was sufficient inconsistency in the past that standardizing is important, but the particular direction of the standardization is less important for the sake of existing shaders. fixes llvm#110736
Configuration menu - View commit details
-
Copy full SHA for c2063de - Browse repository at this point
Copy the full SHA c2063deView commit details -
[SandboxVec][DAG] Implement PredIterator (llvm#111604)
This patch implements an iterator for iterating over both use-def and mem dependencies of MemDGNodes.
Configuration menu - View commit details
-
Copy full SHA for 747d8f3 - Browse repository at this point
Copy the full SHA 747d8f3View commit details -
[LV] Compute value of escaped induction based on the computed end val…
…ue. (llvm#110576) Update fixupIVUsers to compute the value for escaped inductions using the already computed end value of the induction (EndValue), but subtracting the step. This results in slightly simpler codegen, as we avoid computing the full transformed index at VectorTripCount - 1. PR: llvm#110576
Configuration menu - View commit details
-
Copy full SHA for bb937e2 - Browse repository at this point
Copy the full SHA bb937e2View commit details -
[Coroutines] Improve use of unique_ptr (llvm#111870)
* Replace usage of unique_ptr<>(new ...) -> make_unique<>();
Configuration menu - View commit details
-
Copy full SHA for 1252623 - Browse repository at this point
Copy the full SHA 1252623View commit details -
AMDGPU: Minor improvement and cleanup for waterfall loop generation (l…
…lvm#111886) First, ReadlanePieces should be in the scope of each MachineOperand. It is not correct if we declare in a outer scope without clearing after the use for a MachineOperand. Additionally, we do not need the OrigBB argyment for emitLoadScalarOpsFromVGPRLoop, since MachineFunction (the only use) can be obtained from LoopBB (or BodyBB).
Configuration menu - View commit details
-
Copy full SHA for f6e93b8 - Browse repository at this point
Copy the full SHA f6e93b8View commit details -
[Passes] Remove -enable-infer-alignment-pass flag (llvm#111873)
This flag has been on for a while without any complaints.
Configuration menu - View commit details
-
Copy full SHA for e34d614 - Browse repository at this point
Copy the full SHA e34d614View commit details -
Configuration menu - View commit details
-
Copy full SHA for 07892aa - Browse repository at this point
Copy the full SHA 07892aaView commit details -
Configuration menu - View commit details
-
Copy full SHA for a4916d2 - Browse repository at this point
Copy the full SHA a4916d2View commit details -
[NVPTX] Only run LowerUnreachable when necessary (llvm#109868)
Before CUDA 12.3 `ptxas` did not recognize that the trap instruction terminates a basic block. Instead, it would assume that control flow continued to the next instruction. The next instruction could be in the block that's lexically below it. This would lead to phantom CFG edges being created within ptxas. [NVPTX: Lower unreachable to exit to allow ptxas to accurately reconstruct the CFG.](llvm@1ee4d88) added the LowerUnreachable pass to NVPTX to work around this. Several other WAR patches followed. This bug in `ptxas` was fixed in CUDA 12.3 and is thus impossible to encounter when targeting PTX ISA v8.3+ This commit reverts the WARs for the `ptxas` bug when targeting PTX ISA v8.3+ CC @maleadt
Configuration menu - View commit details
-
Copy full SHA for d832a1c - Browse repository at this point
Copy the full SHA d832a1cView commit details -
[Coroutines] Documentation for custom ABIs (llvm#111781)
Update the llvm/docs/Coroutines.rst docs to include a full description of Custom ABI objects. This documentation describes the how ABI objects allow users (plugin libraries) to create custom ABI objects for their needs.
Configuration menu - View commit details
-
Copy full SHA for 29e192a - Browse repository at this point
Copy the full SHA 29e192aView commit details -
[lldb] Implement basic support for reverse-continue (llvm#99736)
This commit only adds support for the `SBProcess::ReverseContinue()` API. A user-accessible command for this will follow in a later commit. This feature depends on a gdbserver implementation (e.g. `rr`) providing support for the `bc` and `bs` packets. `lldb-server` does not support those packets, and there is no plan to change that. So, for testing purposes, `lldbreverse.py` wraps `lldb-server` with a Python implementation of *very limited* record-and-replay functionality for use by *tests only*. The majority of this PR is test infrastructure (about 700 of the 950 lines added).
Configuration menu - View commit details
-
Copy full SHA for d5e1de6 - Browse repository at this point
Copy the full SHA d5e1de6View commit details -
Configuration menu - View commit details
-
Copy full SHA for f0ed31c - Browse repository at this point
Copy the full SHA f0ed31cView commit details -
[NFC][sanitizer] Reopen '/proc/%d/task' instead of seek
NFC because I am not aware of any particular issue from seek, but reopen looks less error prone. Pull Request: llvm#111899
Configuration menu - View commit details
-
Copy full SHA for 942fefe - Browse repository at this point
Copy the full SHA 942fefeView commit details -
[lldb] Return an llvm::Error from GetFrameBaseValue (llvm#111882)
This fixes the following assertion: "Cannot create Expected<T> from Error success value." The problem was that GetFrameBaseValue return false without updating the Status argument. This patch eliminates the opportunity for mistakes by returning an llvm:Error.
Configuration menu - View commit details
-
Copy full SHA for 69b0b7e - Browse repository at this point
Copy the full SHA 69b0b7eView commit details -
[lldb] Add include for SBLanguages in lldb-enumerations (llvm#111907)
This adds an include for SBLanguages.h in lldb-enumerations.h so that files that need this enum do not have to explicitly include SBLanguages.
Configuration menu - View commit details
-
Copy full SHA for b355426 - Browse repository at this point
Copy the full SHA b355426View commit details -
[sanitizer] VReport BeforeFork/AfterFork (llvm#111900)
Forks are common suspects for unusual sanitizer behavior. It can be handy to see them without rebuild.
Configuration menu - View commit details
-
Copy full SHA for 36bd9ae - Browse repository at this point
Copy the full SHA 36bd9aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 86f78c0 - Browse repository at this point
Copy the full SHA 86f78c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for b77fdf5 - Browse repository at this point
Copy the full SHA b77fdf5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 69c0067 - Browse repository at this point
Copy the full SHA 69c0067View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1954869 - Browse repository at this point
Copy the full SHA 1954869View commit details -
[SLP]Track repeated reduced value as it might be vectorized
Need to track changes with the repeated reduced value, since it might be vectorized in the next attempt for reduction vectorization, to correctly generate the code and avoid compiler crash. Fixes llvm#111887
Configuration menu - View commit details
-
Copy full SHA for 4b5018d - Browse repository at this point
Copy the full SHA 4b5018dView commit details -
[test] env -u is not supported on AIX, use
unset
insteadWael Yehia committedOct 10, 2024 Configuration menu - View commit details
-
Copy full SHA for 16ef893 - Browse repository at this point
Copy the full SHA 16ef893View commit details -
Revert "[lldb] Add include for SBLanguages in lldb-enumerations (llvm…
…#111907)" Temporarily Revert until Chelsea can look at this. With a clean build, SBLanguages.h won't be generated in the build directory at the point when it is included by lldb-enumerations when compiling e.g. Broadcaster.cpp. On a clean build (no pre-existing build directory), the dependency ordering is not explicitly stated so the build will fail. An incremental build will succeed. This reverts commit b355426.
Configuration menu - View commit details
-
Copy full SHA for c99b365 - Browse repository at this point
Copy the full SHA c99b365View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5deadc6 - Browse repository at this point
Copy the full SHA 5deadc6View commit details -
[sanitizer] VReport thread status for failed PTRACE_ATTACH (llvm#111901)
Such threads can cause false leak reports, but often it's hard to diagnose the reason of failed PTRACE_ATTACH. Maybe we can find a clue from `/proc/*/task/*/status`
Configuration menu - View commit details
-
Copy full SHA for af7fa27 - Browse repository at this point
Copy the full SHA af7fa27View commit details -
[ThinLTO] Do not duplicate import a function that is actually defined…
… in the current module (llvm#110064) Doing so could cause a bug where the linker tries to remap a function "reimported" from the current module when materializing it, causing a lookup assert in the type mappings.
Configuration menu - View commit details
-
Copy full SHA for 48545a9 - Browse repository at this point
Copy the full SHA 48545a9View commit details -
Revert "[lldb] SetErrorStringWithFormatv -> FromErrorStringWithFormat…
…v (NFC)" This reverts commit b77fdf5.
Configuration menu - View commit details
-
Copy full SHA for f02252e - Browse repository at this point
Copy the full SHA f02252eView commit details -
Revert "[lldb] Implement basic support for reverse-continue (llvm#99736…
…)" This reverts commit d5e1de6.
Configuration menu - View commit details
-
Copy full SHA for 2ff4c25 - Browse repository at this point
Copy the full SHA 2ff4c25View commit details -
[LLDB][Minidump] Have Minidumps save off and properly read TLS data (l…
…lvm#109477) This patch adds the support to `Process.cpp` to automatically save off TLS sections, either via loading the memory region for the module, or via reading `fs_base` via generic register. Then when Minidumps are loaded, we now specify we want the dynamic loader to be the `POSIXDYLD` so we can leverage the same TLS accessor code as `ProcessELFCore`. Being able to access TLS Data is an important step for LLDB generated minidumps to have feature parity with ELF Core dumps.
Configuration menu - View commit details
-
Copy full SHA for e9c8f75 - Browse repository at this point
Copy the full SHA e9c8f75View commit details -
[lldb] Implement basic support for reverse-continue (llvm#99736)
This commit only adds support for the `SBProcess::ReverseContinue()` API. A user-accessible command for this will follow in a later commit. This feature depends on a gdbserver implementation (e.g. `rr`) providing support for the `bc` and `bs` packets. `lldb-server` does not support those packets, and there is no plan to change that. So, for testing purposes, `lldbreverse.py` wraps `lldb-server` with a Python implementation of *very limited* record-and-replay functionality for use by *tests only*. The majority of this PR is test infrastructure (about 700 of the 950 lines added).
Configuration menu - View commit details
-
Copy full SHA for 4f29756 - Browse repository at this point
Copy the full SHA 4f29756View commit details -
Configuration menu - View commit details
-
Copy full SHA for fae7d68 - Browse repository at this point
Copy the full SHA fae7d68View commit details -
[lldb] skip ReverseContinue tests on Darwin
This uses lldb-server in gdbserver mode, which requires a ProcessNative plugin. Darwin does not have a ProcessNative plugin; it uses debugserver instead of lldb-server. Skip these tests.
Configuration menu - View commit details
-
Copy full SHA for c686eeb - Browse repository at this point
Copy the full SHA c686eebView commit details -
Revert "[ThinLTO] Do not duplicate import a function that is actually…
… defined in the current module" (llvm#111919) Reverts llvm#110064
Configuration menu - View commit details
-
Copy full SHA for 1bf271d - Browse repository at this point
Copy the full SHA 1bf271dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 45cc743 - Browse repository at this point
Copy the full SHA 45cc743View commit details -
[ORC][ELF] Remove the ExecutionSession& argument to ELFNixPlatform::C…
…reate. We can get a reference to the ExecutionSession from the ObjectLinkingLayer argument, so there's no need to pass it in separately.
Configuration menu - View commit details
-
Copy full SHA for cc20dd2 - Browse repository at this point
Copy the full SHA cc20dd2View commit details -
Revert "[lldb] skip ReverseContinue tests on Darwin"
This reverts commit c686eeb.
Configuration menu - View commit details
-
Copy full SHA for 4f32077 - Browse repository at this point
Copy the full SHA 4f32077View commit details -
Revert "[lldb] SetErrorStringWithFormatv -> FromErrorStringWithFormat…
…v (NFC)" This reverts commit fae7d68.
Configuration menu - View commit details
-
Copy full SHA for a28e7ce - Browse repository at this point
Copy the full SHA a28e7ceView commit details -
Revert "[lldb] Implement basic support for reverse-continue (llvm#99736…
…)" Reverting this again; I added a commit which added @skipIfDarwin markers to the TestReverseContinueBreakpoints.py and TestReverseContinueNotSupported.py API tests, which use lldb-server in gdbserver mode which does not work on Darwin. But the aarch64 ubuntu bot reported a failure on TestReverseContinueBreakpoints.py, https://lab.llvm.org/buildbot/#/builders/59/builds/6397 File "/home/tcwg-buildbot/worker/lldb-aarch64-ubuntu/llvm-project/lldb/test/API/functionalities/reverse-execution/TestReverseContinueBreakpoints.py", line 63, in test_reverse_continue_skip_breakpoint self.reverse_continue_skip_breakpoint_internal(async_mode=False) File "/home/tcwg-buildbot/worker/lldb-aarch64-ubuntu/llvm-project/lldb/test/API/functionalities/reverse-execution/TestReverseContinueBreakpoints.py", line 81, in reverse_continue_skip_breakpoint_internal self.expect( File "/home/tcwg-buildbot/worker/lldb-aarch64-ubuntu/llvm-project/lldb/packages/Python/lldbsuite/test/lldbtest.py", line 2372, in expect self.runCmd( File "/home/tcwg-buildbot/worker/lldb-aarch64-ubuntu/llvm-project/lldb/packages/Python/lldbsuite/test/lldbtest.py", line 1002, in runCmd self.assertTrue(self.res.Succeeded(), msg + output) AssertionError: False is not true : Process should be stopped due to history boundary Error output: error: Process must be launched. This reverts commit 4f29756.
Configuration menu - View commit details
-
Copy full SHA for 3bef742 - Browse repository at this point
Copy the full SHA 3bef742View commit details -
[HLSL] Implement the
degrees
intrinsic (llvm#111209)- add degrees builtin - link degrees api in hlsl_intrinsics.h - add degrees intrinsic to IntrinsicsDirectX.td - add degrees intrinsic to IntrinsicsSPIRV.td - add lowering from clang builtin to dx/spv intrinsics in CGBuiltin.cpp - add semantic checks to SemaHLSL.cpp - add expansion of directx intrinsic to llvm fmul for DirectX in DXILIntrinsicExpansion.cpp - add mapping to spir-v intrinsic in SPIRVInstructionSelector.cpp - add test coverage: - degrees.hlsl -> check hlsl lowering to dx/spv degrees intrinsics - degrees-errors.hlsl/half-float-only-errors -> check semantic warnings - hlsl-intrinsics/degrees.ll -> check lowering of spir-v degrees intrinsic to SPIR-V backend - DirectX/degrees.ll -> check expansion and scalarization of directx degrees intrinsic to fmul Resolves llvm#99104
Configuration menu - View commit details
-
Copy full SHA for 2647505 - Browse repository at this point
Copy the full SHA 2647505View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6640dac - Browse repository at this point
Copy the full SHA 6640dacView commit details -
[lld][elf] Warn if '*' pattern is used multiple times in version scri…
…pts (llvm#102669) If this pattern is used more than once in version script(s), only one will have an effect, so it's probably a user error and can be diagnosed.
Configuration menu - View commit details
-
Copy full SHA for 1037f57 - Browse repository at this point
Copy the full SHA 1037f57View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0add174 - Browse repository at this point
Copy the full SHA 0add174View commit details
Commits on Oct 11, 2024
-
[NFC][clang] Fix typo in ReleaseNotes (llvm#111930)
Fix a typo in ReleaseNotes that introduced by llvm#86960. Signed-off-by: yronglin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 774c953 - Browse repository at this point
Copy the full SHA 774c953View commit details -
[asan] Prevent printing invalid parent thread (llvm#111916)
By default reuse can happend only after `UINT32_MAX` threads, so it's almost NFC.
Configuration menu - View commit details
-
Copy full SHA for 9c81a24 - Browse repository at this point
Copy the full SHA 9c81a24View commit details -
AArch64: Select FCANONICALIZE (llvm#104429)
FMINNM/FMAXNM instructions of AArch64 follow IEEE754-2008. We can use them to canonicalize a floating point number. And FMINNUM_IEEE/FMAXNUM_IEEE is used by something like expanding FMINIMUMNUM/FMAXIMUMNUM, so let's define them. --------- Co-authored-by: Your Name <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 72fb379 - Browse repository at this point
Copy the full SHA 72fb379View commit details -
[NFC][mlir][spirv] Fix syntax warnings in gen_spirv_dialect.py (llvm#…
…111775) In the context of regular expressions, Python (used to) gracefully ignore the escape behavior of `\` in some contexts, e.g. for representing the regular expression `\w+`. However in newer versions of Python this now gives a warning in the form ``` SyntaxWarning: invalid escape sequence '\w' ``` Fix by explicitly using raw strings instead.
Configuration menu - View commit details
-
Copy full SHA for 6c398ab - Browse repository at this point
Copy the full SHA 6c398abView commit details -
[CodeGenPrepare] Convert
ctpop(X) ==/!= 1
intoctpop(X) u</u> 2/1
(……llvm#111284) Some targets have better codegen for `ctpop(X) u< 2` than `ctpop(X) == 1`. After llvm#100899, we set the range of ctpop's return value to indicate the argument/result is non-zero. This patch converts `ctpop(X) ==/!= 1` into `ctpop(X) u</u> 2/1` in CGP to fix llvm#95255.
Configuration menu - View commit details
-
Copy full SHA for e3894f5 - Browse repository at this point
Copy the full SHA e3894f5View commit details -
[libc] Temporarily disable strerror test on NVPTX
Summary: This is failing on the NVPTX buildbot, https://lab.llvm.org/buildbot/#/builders/69/builds/6997/. I cannot reproduce it locally so I'm disabling it temporarily so the bot is green.
Configuration menu - View commit details
-
Copy full SHA for cbfcea1 - Browse repository at this point
Copy the full SHA cbfcea1View commit details -
[lldb-dap] Implement value locations for function pointers (llvm#104589)
This commit adds `valueLocationReference` to function pointers and function references. Thereby, users can navigate directly to the pointed-to function from within the "variables" pane. In general, it would be useful to also a add similar location references also to member function pointers, `std::source_location`, `std::function`, and many more. Doing so would require extending the formatters to provide such a source code location. There were two RFCs about this a while ago: https://discourse.llvm.org/t/rfc-extending-formatters-with-a-source-code-reference/68375 https://discourse.llvm.org/t/rfc-sbvalue-metadata-provider/68377/26 However, both RFCs ended without a conclusion. As such, this commit now implements the lowest-hanging fruit, i.e. function pointers. If people find it useful, I will revive the RFC afterwards.
Configuration menu - View commit details
-
Copy full SHA for 9f8ae78 - Browse repository at this point
Copy the full SHA 9f8ae78View commit details -
[X86][StrictFP] Combine fcmp + select to fmin/fmax for some predicates (
llvm#109512) X86 maxss/minss etc. instructions won't turn SNaN to QNaN, so we can combine fcmp + select to them for some predicates.
Configuration menu - View commit details
-
Copy full SHA for 9882b35 - Browse repository at this point
Copy the full SHA 9882b35View commit details -
[Clang] Instantiate Typedefs referenced by type alias deduction guides (
llvm#111804) TypedefNameDecl referenced by a synthesized CTAD guide for type aliases was not transformed previously, resulting in a substitution failure in BuildDeductionGuideForTypeAlias() when substituting into the right-hand-side deduction guide. This patch fixes it in the way we have been doing since https://reviews.llvm.org/D80743. We transform all the function parameters, parenting referenced TypedefNameDecls with the CXXDeductionGuideDecl. Then we instantiate these declarations in FindInstantiatedDecl() as we build up the eventual deduction guide, using the mechanism introduced in D80743 Fixes llvm#111508
Configuration menu - View commit details
-
Copy full SHA for 0bc02b9 - Browse repository at this point
Copy the full SHA 0bc02b9View commit details -
Revert "[CodeGenPrepare] Convert
ctpop(X) ==/!= 1
into `ctpop(X) u<……/u> 2/1`" (llvm#111932) Reverts llvm#111284 to fix clang stage2 builds. Investigating... Failed buildbots: https://lab.llvm.org/buildbot/#/builders/76/builds/3576 https://lab.llvm.org/buildbot/#/builders/168/builds/4308 https://lab.llvm.org/buildbot/#/builders/127/builds/1087
Configuration menu - View commit details
-
Copy full SHA for ec3e0a5 - Browse repository at this point
Copy the full SHA ec3e0a5View commit details -
I'm about to post a PR in this area.
Configuration menu - View commit details
-
Copy full SHA for 126ed16 - Browse repository at this point
Copy the full SHA 126ed16View commit details -
[libc][math] Improve performance of double precision trig functions. (l…
…lvm#111793) - Improve the accuracy of fast pass' range reduction. - Provide tighter error estimations. - Reduce the table size when `LIBC_MATH_SMALL_TABLES` flag is set.
Configuration menu - View commit details
-
Copy full SHA for 51e9430 - Browse repository at this point
Copy the full SHA 51e9430View commit details -
Configuration menu - View commit details
-
Copy full SHA for e01ae39 - Browse repository at this point
Copy the full SHA e01ae39View commit details -
[sanitizer] Fix ThreadLister::IsAlive (llvm#111942)
'status_path_' must include `tid`. Regression from llvm#111909.
Configuration menu - View commit details
-
Copy full SHA for 59b2945 - Browse repository at this point
Copy the full SHA 59b2945View commit details -
[clang][bytecode] Return an lvalue path for dummy pointers (llvm#111862)
Not doing this is wrong in general and we need to reject expressions where it would matter differently.
Configuration menu - View commit details
-
Copy full SHA for 36b0707 - Browse repository at this point
Copy the full SHA 36b0707View commit details -
[NFC][sanitizer] Check suspended threads outside `ThreadSuspender::Su…
…spendThread` (llvm#111943) Allows to distinguish failure from stopped threads.
Configuration menu - View commit details
-
Copy full SHA for 374886a - Browse repository at this point
Copy the full SHA 374886aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 36639af - Browse repository at this point
Copy the full SHA 36639afView commit details -
[NFC][asan] Cleanup AsanThreadIdAndName ctor/init (llvm#111923)
Co-authored-by: YunQiang Su <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e556f07 - Browse repository at this point
Copy the full SHA e556f07View commit details -
[asan] Print
unique_id
instead oftid
(llvm#111925)Before the first reuse, after 2^32 threads they are equal.
Configuration menu - View commit details
-
Copy full SHA for df4c913 - Browse repository at this point
Copy the full SHA df4c913View commit details -
[NFC][sanitizer] Simplify GetThreadLocked
Now we can pass `invalid tid`.
Configuration menu - View commit details
-
Copy full SHA for 3cb4d20 - Browse repository at this point
Copy the full SHA 3cb4d20View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf81bd8 - Browse repository at this point
Copy the full SHA bf81bd8View commit details -
[IR] Allow MDString in operand bundles (llvm#110805)
This change implements support of metadata strings in operand bundle values. It makes possible calls like: call void @some_func(i32 %x) [ "foo"(i32 42, metadata !"abc") ] It requires some extension of the bitcode serialization. As SSA values and metadata are stored in different tables, there must be a way to distinguish them during deserialization. It is implemented by putting a special marker before the metadata index. The marker cannot be treated as a reference to any SSA value, so it unambiguously identifies metadata. It allows extending the bitcode serialization without breaking compatibility. Metadata as operand bundle values are intended to be used in floating-point function calls. They would represent the same information as now is passed by the constrained intrinsic arguments.
Configuration menu - View commit details
-
Copy full SHA for 15de239 - Browse repository at this point
Copy the full SHA 15de239View commit details -
[ELF] Move InputSectionBase::file to SectionBase
... and add getCtx (file->ctx). This allows InputSectionBase and OutputSection to access ctx without taking an extra function argument.
Configuration menu - View commit details
-
Copy full SHA for c22588c - Browse repository at this point
Copy the full SHA c22588cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e018f55 - Browse repository at this point
Copy the full SHA e018f55View commit details -
Configuration menu - View commit details
-
Copy full SHA for 25cda9e - Browse repository at this point
Copy the full SHA 25cda9eView commit details -
[clang][bytecode] Check number of addcarry/subborrow args (llvm#111952)
Apparently this can fail as well.
Configuration menu - View commit details
-
Copy full SHA for 1fd79f1 - Browse repository at this point
Copy the full SHA 1fd79f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for d91c103 - Browse repository at this point
Copy the full SHA d91c103View commit details -
[AMDGPU][SplitModule] Cleanup CallsExternal Handling (llvm#106528)
- Don't treat inline ASM as indirect calls - Remove call to alias testing, which was broken (only working by pure luck right now) and isn't needed anyway. GlobalOpt should take care of them for us.
Configuration menu - View commit details
-
Copy full SHA for d656b20 - Browse repository at this point
Copy the full SHA d656b20View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81bd712 - Browse repository at this point
Copy the full SHA 81bd712View commit details -
[AMDGPU] Enable unaligned scratch accesses (llvm#110219)
This allows us to emit wide generic and scratch memory accesses when we do not have alignment information. In cases where accesses happen to be properly aligned or where generic accesses do not go to scratch memory, this improves performance of the generated code by a factor of up to 16x and reduces code size, especially when lowering memcpy and memmove intrinsics. Also: Make the use of the FeatureUnalignedScratchAccess feature more consistent: FeatureUnalignedScratchAccess and EnableFlatScratch are now orthogonal, whereas, before, code assumed that the latter implies the former at some places. Part of SWDEV-455845.
Configuration menu - View commit details
-
Copy full SHA for 173c682 - Browse repository at this point
Copy the full SHA 173c682View commit details -
[C++20][Modules] Allow import for a header unit after #pragma (llvm#1…
…11662) Summary: `#pragma` and headers that finish with them shouldn't prevent `import "header_unit.h"` syntax. Test Plan: check-clang
Configuration menu - View commit details
-
Copy full SHA for 65780f4 - Browse repository at this point
Copy the full SHA 65780f4View commit details -
[clang][bytecode] Use PredefinedExpr as base for its variable (llvm#1…
…11956) This fixes the error message generated.
Configuration menu - View commit details
-
Copy full SHA for ff04bb8 - Browse repository at this point
Copy the full SHA ff04bb8View commit details -
[mlir][sparse][test] Adjust tests for
LowerSparseOpsToForeach
(llvm……#110976) This PR relocates the tests added in llvm#109435 to a new file named `no_lowering.mlir` and adds some new tests.
Configuration menu - View commit details
-
Copy full SHA for bff2b8c - Browse repository at this point
Copy the full SHA bff2b8cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8bb12ca - Browse repository at this point
Copy the full SHA 8bb12caView commit details -
[mlir][linalg] Fix for bias handling for Winograd (llvm#110331)
PR makes winograd.output_transform op a destination style op and fixes handing of a pre-existing data in its output argument (i.e. possibly pre-initialized with bias, which was discarded before). --------- Signed-off-by: Dmitriy Smirnov <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for bb4696c - Browse repository at this point
Copy the full SHA bb4696cView commit details -
[lldb] Only send "posix" error codes through the gdb-remote protocol (l…
…lvm#108170) The other side has no way of telling which namespace do these codes belong to, so mashing them all together is not very helpful. I'm mainly doing this to simplify some code in a pending patch <https://github.com/llvm/llvm-project/pull/106774/files#r1752628604>, and I've picked the posix error category semi-randomly. If we wanted to be serious about assigning meaning to these error codes, we should create a special error category for "gdb errors".
Configuration menu - View commit details
-
Copy full SHA for ebeb56a - Browse repository at this point
Copy the full SHA ebeb56aView commit details -
[lldb][test] Fixed the test
no_unique_address-with-bitfields
(llvm#……111902) Fixed the error `unable to create target: 'No available targets are compatible with triple "x86_64-apple-macosx10.4.0"'` running `clang --target=x86_64-apple-macosx -c -gdwarf -o %t %s`.
Configuration menu - View commit details
-
Copy full SHA for b222f31 - Browse repository at this point
Copy the full SHA b222f31View commit details -
[LoopVectorize] Use predicated version of getSmallConstantMaxTripCount (
llvm#109928) There are a number of places where we call getSmallConstantMaxTripCount without passing a vector of predicates: getSmallBestKnownTC isIndvarOverflowCheckKnownFalse computeMaxVF isMoreProfitable I've changed all of these to now pass in a predicate vector so that we get the benefit of making better vectorisation choices when we know the max trip count for loops that require SCEV predicate checks. I've tried to add tests that cover all the cases affected by these changes.
Configuration menu - View commit details
-
Copy full SHA for 72f339d - Browse repository at this point
Copy the full SHA 72f339dView commit details -
Revert "[mlir][linalg] Introduce transpose semantic to 'linalg.matmul…
…' ops. (llvm#104783)" This reverts commit 0348373 and 99c8557, which is a fix-up on top of the former. I'm reverting because this commit broke two tests: mlir/test/python/integration/dialects/linalg/opsrun.py mlir/test/python/integration/dialects/transform.py See https://lab.llvm.org/buildbot/#/builders/138/builds/4872 I'm not familiar with the tests, so I'm leaving it to the original author to either remove or adapt the broken tests, as discussed here: llvm#104783 (comment)
Configuration menu - View commit details
-
Copy full SHA for 1276ce9 - Browse repository at this point
Copy the full SHA 1276ce9View commit details -
[XRay] Add support for instrumentation of DSOs on x86_64 (llvm#90959)
This PR introduces shared library (DSO) support for XRay based on a revised version of the implementation outlined in [this RFC](https://discourse.llvm.org/t/rfc-upstreaming-dso-instrumentation-support-for-xray/73000). The feature enables the patching and handling of events from DSOs, supporting both libraries linked at startup or explicitly loaded, e.g. via `dlopen`. This patch adds the following: - The `-fxray-shared` flag to enable the feature (turned off by default) - A small runtime library that is linked into every instrumented DSO, providing position-independent trampolines and code to register with the main XRay runtime - Changes to the XRay runtime to support management and patching of multiple objects These changes are fully backward compatible, i.e. running without instrumented DSOs will produce identical traces (in terms of recorded function IDs) to the previous implementation. Due to my limited ability to test on other architectures, this feature is only implemented and tested with x86_64. Extending support to other architectures is fairly straightforward, requiring only a position-independent implementation of the architecture-specific trampoline implementation (see `compiler-rt/lib/xray/xray_trampoline_x86_64.S` for reference). This patch does not include any functionality to resolve function IDs from DSOs for the provided logging/tracing modes. These modes still work and will record calls from DSOs, but symbol resolution for these functions in not available. Getting this to work properly requires recording information about the loaded DSOs and should IMO be discussed in a separate RFC, as there are mulitple feasible approaches. @petrhosek @jplehr
Configuration menu - View commit details
-
Copy full SHA for a440203 - Browse repository at this point
Copy the full SHA a440203View commit details -
[clang][ExprConstant] Remove an outdated TODO comment (llvm#111959)
Seems like passing the quantities directly seems to work fine.
Configuration menu - View commit details
-
Copy full SHA for 42ec740 - Browse repository at this point
Copy the full SHA 42ec740View commit details -
AMDGPU/GlobalISel: Fix inst-selection of ballot (llvm#109986)
Both input and output of ballot are lane-masks: result is lane-mask with 'S32/S64 LLT and SGPR bank' input is lane-mask with 'S1 LLT and VCC reg bank'. Ballot copies bits from input lane-mask for all active lanes and puts 0 for inactive lanes. GlobalISel did not set 0 in result for inactive lanes for non-constant input.
Configuration menu - View commit details
-
Copy full SHA for 7b0d56b - Browse repository at this point
Copy the full SHA 7b0d56bView commit details -
[mlir][EmitC] Fail on memrefs with 0 dims in type conversion (llvm#11…
…1965) This let's the type conversion fail instead of generating invalid array types.
Configuration menu - View commit details
-
Copy full SHA for 7771429 - Browse repository at this point
Copy the full SHA 7771429View commit details -
Configuration menu - View commit details
-
Copy full SHA for 80c15c4 - Browse repository at this point
Copy the full SHA 80c15c4View commit details -
[OpenMP][Flang] Enable alias analysis inside omp target region (llvm#…
…111670) At present, alias analysis does not work for operations inside OMP target regions because the FIR declare operations within OMP target do not offer sufficient information for alias analysis. Consequently, it is necessary to examine the FIR code outside the OMP target region.
Configuration menu - View commit details
-
Copy full SHA for 73ad416 - Browse repository at this point
Copy the full SHA 73ad416View commit details -
[clang][analyzer] PointerSubChecker should not warn on pointers conve…
…rted to numerical type (llvm#111846) Pointer values casted to integer (non-pointer) type should be able to be subtracted as usual.
Configuration menu - View commit details
-
Copy full SHA for f74f568 - Browse repository at this point
Copy the full SHA f74f568View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a65e98 - Browse repository at this point
Copy the full SHA 6a65e98View commit details -
[LV] Account for any-of reduction when computing costs of blend phis.
Any-of reductions are narrowed to i1. Update the legacy cost model to use the correct type when computing the cost of a phi that gets lowered to selects (BLEND). This fixes a divergence between legacy and VPlan-based cost models after 36fc291. Fixes llvm#111874.
Configuration menu - View commit details
-
Copy full SHA for 65da32c - Browse repository at this point
Copy the full SHA 65da32cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d941254 - Browse repository at this point
Copy the full SHA d941254View commit details -
[XRay] Fix LLVM include in xray_interface.cpp (llvm#111978)
Removes a dependency on LLVM in `xray_interface.cpp` by replacing `llvm_unreachable` with compiler-rt's `UNREACHABLE`. Applies clang-format to some unformatted changes. Original PR: llvm#90959
Configuration menu - View commit details
-
Copy full SHA for 4451f9f - Browse repository at this point
Copy the full SHA 4451f9fView commit details -
[Flang][OpenMP]Add tests for TODOs and small changes to improve messa…
…ges (llvm#111562) The bulk of this change are new tests to check that we get a "Not yet implemneted: *some stuff here*" message when using some not yet supported OpenMP functionality. For some of these cases, this also means adding additional clauses to a filter list in OpenMP.cpp - this changes nothing [to the best of my understanding] other than allowing the clause to get to the point where it can be rejected in a TODO with a more clear message. One of the TOOD filters were missing Mergeable clause, so this was also added and the existing test updated for the new more specific error message. There is no functional change intended here.
Configuration menu - View commit details
-
Copy full SHA for 0163ac1 - Browse repository at this point
Copy the full SHA 0163ac1View commit details -
[RISCV][MC] Fix >32bit .insn Directives (llvm#111878)
The original patch had a reasonably significant bug. You could not use `.insn` to assemble encodings that had any bits set above the low 32 bits. This is due to the fact that `getMachineOpValue` was truncating the immediate value, and I did not commit enough tests of useful cases. This changes the result of `getMachineOpValue` to be able to return the 48-bit and 64-bit immediates needed for the wider `.insn` directives. I took the opportunity to move some of the test cases around in the file to make looking at the output of `llvm-objdump` a little clearer.
Configuration menu - View commit details
-
Copy full SHA for b5ea5be - Browse repository at this point
Copy the full SHA b5ea5beView commit details -
[AArch64] Add SchedReadAdvance to Neoverse-V1 scheduling model. (llvm…
…#111538) Introduce a description of late forwarding to the Neoverse-V1 Scheduling model.
Configuration menu - View commit details
-
Copy full SHA for 303c8d2 - Browse repository at this point
Copy the full SHA 303c8d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 90627a5 - Browse repository at this point
Copy the full SHA 90627a5View commit details -
CodeGen: Remove redundant REQUIRES registered-target from tests (llvm…
…#111982) These are already in target specific test directories.
Configuration menu - View commit details
-
Copy full SHA for 14705a9 - Browse repository at this point
Copy the full SHA 14705a9View commit details -
[NFC][CodingStandard] Add additional example for if-else brace rule (l…
…lvm#111733) Add example to document that single statement `else` needs a brace if the associated `if` needs a brace.
Configuration menu - View commit details
-
Copy full SHA for 900ea21 - Browse repository at this point
Copy the full SHA 900ea21View commit details -
[NFC] Rename
Intrinsic::getDeclaration
togetOrInsertDeclaration
(l……lvm#111752) Rename the function to reflect its correct behavior and to be consistent with `Module::getOrInsertFunction`. This is also in preparation of adding a new `Intrinsic::getDeclaration` that will have behavior similar to `Module::getFunction` (i.e, just lookup, no creation).
Configuration menu - View commit details
-
Copy full SHA for fa789df - Browse repository at this point
Copy the full SHA fa789dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for c84f759 - Browse repository at this point
Copy the full SHA c84f759View commit details -
[RISCV] Add DAG combine to turn (sub (shl X, 8-Y), (shr X, Y)) into o…
…rc.b (llvm#111828) This patch generalizes the DAG combine for `(sub (shl X, 8), X) => (orc.b X)` into the more general form of `(sub (shl X, 8 - Y), (srl X, Y)) => (orc.b X)`. Alive2 generalized proof: https://alive2.llvm.org/ce/z/dFcf_n Related issue: llvm#96595 Related PR: llvm#96680
Configuration menu - View commit details
-
Copy full SHA for 26b832a - Browse repository at this point
Copy the full SHA 26b832aView commit details -
Revert "[NVPTX] Prefer prmt.b32 over bfi.b32 (llvm#110766)"
This reverts commit 3f9998a. It breaks downstream tests with egregious numerical differences. Unfortunately no upstream tests are broken, but the fact that a prior iteration of the commit (pre-optimization) does work with our downstream tests (coming from the Triton repo) supports the claim that the final version of the commit is incorrect. Reverting now so that the original author can evaluate.
Configuration menu - View commit details
-
Copy full SHA for 9a696b6 - Browse repository at this point
Copy the full SHA 9a696b6View commit details -
[Clang] Replace Intrinsic::getDeclaration with getOrInsertDeclaration (…
…llvm#111990) Fix build failure from the rename change. Looks like one additional reference sneaked in between pre-commit checks and the commit itself.
Configuration menu - View commit details
-
Copy full SHA for c8da225 - Browse repository at this point
Copy the full SHA c8da225View commit details -
Revert "[clang] Implement TTP P0522 pack matching for deduced functio…
…n template calls. (llvm#111457)" See discussion in llvm#111711 This reverts commit 4dadf42.
Configuration menu - View commit details
-
Copy full SHA for ed7251b - Browse repository at this point
Copy the full SHA ed7251bView commit details -
Revert "[clang] CWG2398: improve overload resolution backwards compat (…
…llvm#107350)" See discussion in llvm#111711 This reverts commit 224519b.
Configuration menu - View commit details
-
Copy full SHA for bdd46cc - Browse repository at this point
Copy the full SHA bdd46ccView commit details -
Revert "Reland: [clang] Finish implementation of P0522 (llvm#111711)"
See discussion in llvm#111711 This reverts commit 6213aa5.
Configuration menu - View commit details
-
Copy full SHA for efcfa6e - Browse repository at this point
Copy the full SHA efcfa6eView commit details -
[AMDGPU] Rewrite RegSeqNames using !foreach. NFC. (llvm#111994)
This reduces the total number of TableGen records produced by AMDGPU.td by about 6%.
Configuration menu - View commit details
-
Copy full SHA for 870d37d - Browse repository at this point
Copy the full SHA 870d37dView commit details -
Turn
-Wdeprecated-literal-operator
on by default (llvm#111027)It would be nice to see what our users think about this change, as this is something that WG21/EWG quite wants to fix a handful of questionable issues with UB. Depending on the outcome of this after being committed, we might instead suggest EWG undeprecate this, and require a bit of 'magic' from the lexer. Additionally, this patch makes it so we emit this diagnostic ALSO in cases where the literal name is reserved. It doesn't make sense to limit that. --------- Co-authored-by: Vlad Serebrennikov <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c8554e1 - Browse repository at this point
Copy the full SHA c8554e1View commit details -
[lldb] Fix finding make tool for tests (llvm#111980)
Fixes 0e91323 / llvm#111531 For reasons I can't explain, a clean build works fine for me, and all the bots are working fine. But if I rebuild in some way the make tool becomes None. Looking at the other variables, they had these extra lines so I've added those for make and it seems to solve the problem.
Configuration menu - View commit details
-
Copy full SHA for a2bd5db - Browse repository at this point
Copy the full SHA a2bd5dbView commit details -
[AArch64] Disable consecutive store merging when Neon is unavailable (l…
…lvm#111519) Lowering fixed-size BUILD_VECTORS without Neon may introduce stack spills, leading to more stores/reloads than if the stores were not merged. In some cases, it can also prevent using paired store instructions. In the future, we may want to relax when SVE is available, but currently, the SVE lowerings for BUILD_VECTOR are limited to a few specific cases.
Configuration menu - View commit details
-
Copy full SHA for c3a10dc - Browse repository at this point
Copy the full SHA c3a10dcView commit details -
[clang] Ignore inline namespace for
hasName
(llvm#109147)Add a new enumeration `SuppressInlineNamespaceMode` to `PrintingPolicy` that is explicit about how to handle inline namespaces. `SuppressInlineNamespace` uses that enumeration now instead of a Boolean value. Specializing a template from an inline namespace should be transparent. For instance ``` namespace foo { inline namespace v1 { template<typename A> void function(A&); } } namespace foo { template<> void function<int>(int&); } ``` `hasName` should match both declarations of `foo::function`. Makes the behavior of `matchesNodeFullSlow` and `matchesNodeFullFast` consistent, fixing an assert inside `HasNameMatcher::matchesNode`.
Configuration menu - View commit details
-
Copy full SHA for bd12729 - Browse repository at this point
Copy the full SHA bd12729View commit details -
[libc++abi] Remove unused LIBCXXABI_LIBCXX_INCLUDES CMake option (llv…
…m#111824) This hasn't been used for several years, so it's effectively dead code at this point.
Configuration menu - View commit details
-
Copy full SHA for 923fef9 - Browse repository at this point
Copy the full SHA 923fef9View commit details -
[libc++] Add output groups to run-buildbot (llvm#111739)
This improves the CI output by providing collapsable sections for sub-parts of our build. This was originally opened as llvm#75233. Co-authored-by: eric <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 33c14f1 - Browse repository at this point
Copy the full SHA 33c14f1View commit details -
[mlir][vector] Add more tests for ConvertVectorToLLVM (7/n) (llvm#111895
Configuration menu - View commit details
-
Copy full SHA for f7eb271 - Browse repository at this point
Copy the full SHA f7eb271View commit details -
[RISCV] Introduce VLOptimizer pass (llvm#108640)
The purpose of this optimization is to make the VL argument, for instructions that have a VL argument, as small as possible. This is implemented by visiting each instruction in reverse order and checking that if it has a VL argument, whether the VL can be reduced. By putting this pass before VSETVLI insertion, we see three kinds of changes to generated code: 1. Eliminate VSETVLI instructions 2. Reduce the VL toggle on VSETVLI instructions that also change vtype 3. Reduce the VL set by a VSETVLI instruction The list of supported instructions is currently whitelisted for safety. In the future, we could add more instructions to `isSupportedInstr` to support even more VL optimization. We originally wrote this pass because vector GEP instructions do not take a VL, which leads us to emit code that uses VL=VLMAX to implement GEP in the RISC-V backend. As a result, some of the vector instructions will write to lanes, specifically between the intended VL and VLMAX, that will never be read. As an alternative to this pass, we considered adding a vector predicated GEP instruction, but this would not fit well into the intrinsic type system since GEP has a variable number of arguments, each with arbitrary types. The second approach we considered was to put this pass after VSETVLI insertion, but we found that it was more difficult to recognize optimization opportunities, especially across basic block boundaries -- the data flow analysis was also a bit more expensive and complex. While this pass solves the GEP problem, we have expanded it to handle more cases of VL optimization, and there is opportunity for the analysis to be improved to enable even more optimization. We have a few follow up patches to post, but figured this would be a good start. --------- Co-authored-by: Craig Topper <[email protected]> Co-authored-by: Kito Cheng <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1c94388 - Browse repository at this point
Copy the full SHA 1c94388View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5dac691 - Browse repository at this point
Copy the full SHA 5dac691View commit details -
[mlir] [dataflow] unify semantics of program point (llvm#110344)
The concept of a 'program point' in the original data flow framework is ambiguous. It can refer to either an operation or a block itself. This representation has different interpretations in forward and backward data-flow analysis. In forward data-flow analysis, the program point of an operation represents the state after the operation, while in backward data flow analysis, it represents the state before the operation. When using forward or backward data-flow analysis, it is crucial to carefully handle this distinction to ensure correctness. This patch refactors the definition of program point, unifying the interpretation of program points in both forward and backward data-flow analysis. How to integrate this patch? For dense forward data-flow analysis and other analysis (except dense backward data-flow analysis), the program point corresponding to the original operation can be obtained by `getProgramPointAfter(op)`, and the program point corresponding to the original block can be obtained by `getProgramPointBefore(block)`. For dense backward data-flow analysis, the program point corresponding to the original operation can be obtained by `getProgramPointBefore(op)`, and the program point corresponding to the original block can be obtained by `getProgramPointAfter(block)`. NOTE: If you need to get the lattice of other data-flow analyses in dense backward data-flow analysis, you should still use the dense forward data-flow approach. For example, to get the Executable state of a block in dense backward data-flow analysis and add the dependency of the current operation, you should write: ``getOrCreateFor<Executable>(getProgramPointBefore(op), getProgramPointBefore(block))`` In case above, we use getProgramPointBefore(op) because the analysis we rely on is dense backward data-flow, and we use getProgramPointBefore(block) because the lattice we query is the result of a non-dense backward data flow computation. related dsscussion: https://discourse.llvm.org/t/rfc-unify-the-semantics-of-program-points/80671/8 corresponding PSA: https://discourse.llvm.org/t/psa-program-point-semantics-change/81479
Configuration menu - View commit details
-
Copy full SHA for 4b3f251 - Browse repository at this point
Copy the full SHA 4b3f251View commit details -
[IR] LangRef: state explicitly that floats generally behave according…
… to IEEE-754 (llvm#102140) Fixes llvm#60942: IEEE semantics is likely what many frontends want (it definitely is what Rust wants), and it is what LLVM passes already assume when they use APFloat to propagate float operations. This does not reflect what happens on x87, but what happens there is just plain unsound (llvm#89885, llvm#44218); there is no coherent specification that will describe this behavior correctly -- the backend in combination with standard LLVM passes is just fundamentally buggy in a hard-to-fix-way. There's also the questions around flushing subnormals to zero, but [this discussion](https://discourse.llvm.org/t/questions-about-llvm-canonicalize/79378) seems to indicate a general stance of: this is specific non-standard hardware behavior, and generally needs LLVM to be told that basic float ops do not return the standard result. Just naively running LLVM-compiled code on hardware configured to flush subnormals will lead to llvm#89885-like issues. AFAIK this is also what Alive2 implements (@nunoplopes please correct me if I am wrong).
Configuration menu - View commit details
-
Copy full SHA for a8a6624 - Browse repository at this point
Copy the full SHA a8a6624View commit details -
[TTI][AMDGPU] Allow targets to adjust
LastCallToStaticBonus
via `ge……tInliningLastCallToStaticBonus` (llvm#111311) Currently we will not be able to inline a large function even if it only has one live use because the inline cost is still very high after applying `LastCallToStaticBonus`, which is a constant. This could significantly impact the performance because CSR spill is very expensive. This PR adds a new function `getInliningLastCallToStaticBonus` to TTI to allow targets to customize this value. Fixes SWDEV-471398.
Configuration menu - View commit details
-
Copy full SHA for e34e27f - Browse repository at this point
Copy the full SHA e34e27fView commit details -
[OpenACC] Fix 'classof' for two unused types.
While implementing a different clause, I discovered these placeholder clauses had their 'classof' implementation done incorrectly, so this fixes that.
Configuration menu - View commit details
-
Copy full SHA for 2d133aa - Browse repository at this point
Copy the full SHA 2d133aaView commit details -
[OpenACC] Make all AST enums have an underlying type.
We store these in a few places, so ensuring they are kept in a uint8_t will minimize the amount of storage on the stack.
Configuration menu - View commit details
-
Copy full SHA for bae17a2 - Browse repository at this point
Copy the full SHA bae17a2View commit details -
[mlir][linalg] raise generic to named ops. (llvm#110421)
Add support for specializing linalg.broadcast and linalg.transform from generic. Also, does some refactoring to reuse specialization checks, migrating some common uses to op interface methods.
Configuration menu - View commit details
-
Copy full SHA for c13f806 - Browse repository at this point
Copy the full SHA c13f806View commit details -
[RISCV] Fix -Wunused-function in RISCVVLOptimizer.cpp (NFC)
/llvm-project/llvm/lib/Target/RISCV/RISCVVLOptimizer.cpp:125:21: error: unused function 'operator<<' [-Werror,-Wunused-function] static raw_ostream &operator<<(raw_ostream &OS, const OperandInfo &OI) { ^ 1 error generated.
Configuration menu - View commit details
-
Copy full SHA for 8b17916 - Browse repository at this point
Copy the full SHA 8b17916View commit details -
[SandboxVec][DAG] Extend DAG (llvm#111908)
This patch implements growing the DAG towards the top or bottom. This does the necessary dependency checks and adds new mem dependencies.
Configuration menu - View commit details
-
Copy full SHA for e8dd95e - Browse repository at this point
Copy the full SHA e8dd95eView commit details -
[AMDGPU] Avoid resource propagation for recursion through multiple fu…
…nctions (llvm#111004) Avoid constructing recursive MCExpr definitions when multiple functions cause a recursion. Fixes llvm#110863
Configuration menu - View commit details
-
Copy full SHA for 67160c5 - Browse repository at this point
Copy the full SHA 67160c5View commit details -
[AMDGPU][SIPreEmitPeephole] mustRetainExeczBranch: use BranchProbabil…
…ity and TargetSchedmodel (llvm#109818) Remove s_cbranch_execnz branches if the transformation is profitable according to `BranchProbability` and `TargetSchedmodel`.
Configuration menu - View commit details
-
Copy full SHA for 2d5f3b0 - Browse repository at this point
Copy the full SHA 2d5f3b0View commit details -
[ELF] Make shouldAddProvideSym return values consistent when demoted …
…to Undefined Case: `PROVIDE(f1 = bar);` when both `f1` and `bar` are in separate sections that would be discarded by GC. Due to `demoteDefined`, `shouldAddProvideSym(f1)` may initially return false (when Defined) and then return true (been demoted to Undefined). ``` addScriptReferencedSymbolsToSymTable shouldAddProvideSym(f1): false // the RHS (bar) is not added to `referencedSymbols` and may be GCed declareSymbols shouldAddProvideSym(f1): false markLive demoteSymbolsAndComputeIsPreemptible // demoted f1 to Undefined processSymbolAssignments addSymbol shouldAddProvideSym(f1): true ``` The inconsistency can cause `cmd->expression()` in `addSymbol` to be evaluated, leading to `symbol not found: bar` errors (since `bar` in the RHS is not in `referencedSymbols` and is GCed) (llvm#111478). Fix this by adding a `sym->isUsedInRegularObj` condition, making `shouldAddProvideSym(f1)` values consistent. In addition, we need a `sym->exportDynamic` condition to keep provide-shared.s working. Fixes: ebb326a Pull Request: llvm#111945
Configuration menu - View commit details
-
Copy full SHA for 1c6688a - Browse repository at this point
Copy the full SHA 1c6688aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b174689 - Browse repository at this point
Copy the full SHA b174689View commit details -
[llvm-profdata] Default to MemProf version 3 (llvm#108863)
It's very confusing to have support for Verion 3 but not default to it. This patch teaches llvm-profdata to use MemProf version 3 by default.
Configuration menu - View commit details
-
Copy full SHA for 75774c1 - Browse repository at this point
Copy the full SHA 75774c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for eef6c09 - Browse repository at this point
Copy the full SHA eef6c09View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a53dc6 - Browse repository at this point
Copy the full SHA 8a53dc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2acec3e - Browse repository at this point
Copy the full SHA 2acec3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f47627 - Browse repository at this point
Copy the full SHA 0f47627View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f156ef - Browse repository at this point
Copy the full SHA 3f156efView commit details -
[OpenACC] Implement loop 'gang' clause. (llvm#112006)
The 'gang' clause is used to specify parallel execution of loops, thus has some complicated rules depending on the 'loop's associated compute construct. This patch implements all of those.
Configuration menu - View commit details
-
Copy full SHA for 5b25c31 - Browse repository at this point
Copy the full SHA 5b25c31View commit details -
Support inline diagnostics in CommandReturnObject (llvm#110901)
and implement them for dwim-print (a.k.a. `p`) as an example. The next step will be to expose them as structured data in SBCommandReturnObject.
Configuration menu - View commit details
-
Copy full SHA for 089227f - Browse repository at this point
Copy the full SHA 089227fView commit details -
[mlir] Fix allocateAndCopyWithAlign for immutable (llvm#108679)
Previously this would assert when attempting to getMutableData.
Configuration menu - View commit details
-
Copy full SHA for b96ebee - Browse repository at this point
Copy the full SHA b96ebeeView commit details -
Revert "[AMDGPU] Avoid resource propagation for recursion through mul…
…tiple functions" (llvm#112013) Reverts llvm#111004
Configuration menu - View commit details
-
Copy full SHA for 50866e8 - Browse repository at this point
Copy the full SHA 50866e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c01b27 - Browse repository at this point
Copy the full SHA 2c01b27View commit details -
Configuration menu - View commit details
-
Copy full SHA for 04af63b - Browse repository at this point
Copy the full SHA 04af63bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 188ede2 - Browse repository at this point
Copy the full SHA 188ede2View commit details -
[X86] Add test coverage for llvm#110317
Add tests showing potential to use PSHUFB for shifts of constant uniform values by using a pre-computed LUT of all legal shift amounts
Configuration menu - View commit details
-
Copy full SHA for 03447ab - Browse repository at this point
Copy the full SHA 03447abView commit details -
[mlir][spirv] Make gen_spirv_dialect.py more precise when finding ins…
…t_category (llvm#111777) Use a word boundary, current code was currently failing when parsing the definition of because it would also match `CooperativeMatrixOp` from a later mention of `SPIRV_KHR_CooperativeMatrixOperandsAttr`.
Configuration menu - View commit details
-
Copy full SHA for 64bcb27 - Browse repository at this point
Copy the full SHA 64bcb27View commit details -
[flang][OpenMP] Parsing support for map type modifiers (llvm#111860)
This commit adds parsing of type modifiers for the MAP clause: CLOSE, OMPX_HOLD, and PRESENT. The support for ALWAYS has already existed. The new modifiers are not yet handled in lowering: when present, a TODO message is emitted and compilation stops.
Configuration menu - View commit details
-
Copy full SHA for 697d65d - Browse repository at this point
Copy the full SHA 697d65dView commit details -
[ORC] Try to skip unsupported targets in reoptimization test.
Many build bots are getting failures because of this: https://lab.llvm.org/buildbot/#/builders/140/builds/8600 https://lab.llvm.org/buildbot/#/builders/137/builds/6824 https://lab.llvm.org/buildbot/#/builders/140/builds/8600
Configuration menu - View commit details
-
Copy full SHA for 222d8fa - Browse repository at this point
Copy the full SHA 222d8faView commit details -
[ORC] Try to skip unsupported targets in another reoptimization test.
Skipping another reoptimization test when target is not found.
Configuration menu - View commit details
-
Copy full SHA for 7fea5c0 - Browse repository at this point
Copy the full SHA 7fea5c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 756ef97 - Browse repository at this point
Copy the full SHA 756ef97View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ef737a - Browse repository at this point
Copy the full SHA 4ef737aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9162754 - Browse repository at this point
Copy the full SHA 9162754View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f1b465 - Browse repository at this point
Copy the full SHA 7f1b465View commit details -
[SandboxIR] Add a test for creating non-contiguous Regions. (llvm#112027
Configuration menu - View commit details
-
Copy full SHA for 2aa1dbf - Browse repository at this point
Copy the full SHA 2aa1dbfView commit details -
[Clang] [Sema] Don't crash on unexpanded pack in invalid block literal (
llvm#110762) Consider llvm#109148: ```c++ template <typename ...Ts> void f() { [] { (^Ts); }; } ``` When we encounter `^Ts`, we try to parse a block and subsequently call `DiagnoseUnexpandedParameterPack()` (in `ActOnBlockArguments()`), which sees `Ts` and sets `ContainsUnexpandedParameterPack` to `true` in the `LambdaScopeInfo` of the enclosing lambda. However, the entire block is subsequently discarded entirely because it isn’t even syntactically well-formed. As a result, `ContainsUnexpandedParameterPack` is `true` despite the lambda’s body no longer containing any unexpanded packs, which causes an assertion the next time `DiagnoseUnexpandedParameterPack()` is called. This pr moves handling of unexpanded parameter packs into `CapturingScopeInfo` instead so that the same logic is used for both blocks and lambdas. This fixes this issue since the `ContainsUnexpandedParameterPack` flag is now part of the block (and before that, its `CapturingScopeInfo`) and no longer affects the surrounding lambda directly when the block is parsed. Moreover, this change makes blocks actually usable with pack expansion. This fixes llvm#109148.
Configuration menu - View commit details
-
Copy full SHA for 48bda00 - Browse repository at this point
Copy the full SHA 48bda00View commit details -
Reapply "[Clang][Sema] Refactor collection of multi-level template ar…
…gument lists (llvm#106585, llvm#111173)" (llvm#111852) This patch reapplies llvm#111173, fixing a bug when instantiating dependent expressions that name a member template that is later explicitly specialized for a class specialization that is implicitly instantiated. The bug is addressed by adding the `hasMemberSpecialization` function, which return `true` if _any_ redeclaration is a member specialization. This is then used when determining the instantiation pattern for a specialization of a template, and when collecting template arguments for a specialization of a template.
Configuration menu - View commit details
-
Copy full SHA for 2bb3d3a - Browse repository at this point
Copy the full SHA 2bb3d3aView commit details -
Configuration menu - View commit details
-
Copy full SHA for f0909e3 - Browse repository at this point
Copy the full SHA f0909e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for bc53359 - Browse repository at this point
Copy the full SHA bc53359View commit details -
[SandboxVec][Interval] Implement Interval::comesBefore() (llvm#112026)
This patch implements `Interval::comesBefore(const Interval &Other)` which returns true if this interval is strictly before Other in program order. The function asserts that the intervals are disjoint.
Configuration menu - View commit details
-
Copy full SHA for 31b85c6 - Browse repository at this point
Copy the full SHA 31b85c6View commit details -
[MachineVerifier] Report errors from one thread at a time (llvm#111605)
Create the `ReportedErrors` class to track the number of reported errors during verification. The class will block reporting errors if some other thread is currently reporting an error. I've encountered a case where there were many different verifications reporting errors at the same time on different threads. This ensures that we don't start printing the error from one case until we are completely done printing errors from other cases. Most of the time `AbortOnError = true` so we usually abort after reporting the first error. Depends on llvm#111602.
Configuration menu - View commit details
-
Copy full SHA for adaa603 - Browse repository at this point
Copy the full SHA adaa603View commit details -
[mlir][OpenMP] Implement the ConvertToLLVMPatternInterface (llvm#101997)
This patch implements the `ConvertToLLVMPatternInterface` for the OpenMP dialect, allowing `convert-to-llvm` to act on the OpenMP dialect.
Configuration menu - View commit details
-
Copy full SHA for 58d9703 - Browse repository at this point
Copy the full SHA 58d9703View commit details -
[SandboxVec][Interval][NFC] Rename From/To to Top/Bottom (llvm#112034)
The API was already using top()/bottom() but internally we were still using From/To. This patch fixes this. Top/Bottom seems a better choice because implies program order, whereas From/To does not.
Configuration menu - View commit details
-
Copy full SHA for cc8edbc - Browse repository at this point
Copy the full SHA cc8edbcView commit details -
[RISCV] Enable store clustering by default (llvm#73796)
Builds on llvm#73789, enabling store clustering by default using the same heuristic.
Configuration menu - View commit details
-
Copy full SHA for 2967e5f - Browse repository at this point
Copy the full SHA 2967e5fView commit details -
[lld][Hexagon] Support predicated-add GOT_16_X mask lookup (llvm#111896)
When encountering an instruction like `if (p0) r0 = add(r0,##bar@GOT)`, lld would fail with: ``` ld.lld: error: unrecognized instruction for 16_X type: 0x7400C000 ``` This issue was encountered while building libreadline with clang 19.1.0. Fixes: llvm#111876
Configuration menu - View commit details
-
Copy full SHA for 77aa825 - Browse repository at this point
Copy the full SHA 77aa825View commit details -
[AMDGPU][test] update test with update_mc_test_check (llvm#111913)
a non-functional change Update test script with update_mc_test_check script and sort the testline to be alphabetic order. This helps to maintain the test file in a clean state
Configuration menu - View commit details
-
Copy full SHA for 16c8056 - Browse repository at this point
Copy the full SHA 16c8056View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ed8acf - Browse repository at this point
Copy the full SHA 3ed8acfView commit details -
Configuration menu - View commit details
-
Copy full SHA for b95bf05 - Browse repository at this point
Copy the full SHA b95bf05View commit details
Commits on Oct 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 00aafdc - Browse repository at this point
Copy the full SHA 00aafdcView commit details