[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts #110324

jurahul · 2024-09-27T20:35:01Z

Check name conflicts between intrinsics caused by mangling suffix.

If the base name of an overloaded intrinsic is a proper prefix of another intrinsic, check if the other intrinsic name suffix after the proper prefix can match a mangled type and issue an error if it can.

llvmbot · 2024-09-30T17:08:39Z

@llvm/pr-subscribers-tablegen

@llvm/pr-subscribers-llvm-ir

Author: Rahul Joshi (jurahul)

Changes

Check name conflicts between intrinsics caused by mangling suffix.

If the base name of an overloaded intrinsic is a proper prefix of another intrinsic, check if the other intrinsic name suffix after the proper prefix can match a mangled type and issue an error if it can.

Full diff: https://github.com/llvm/llvm-project/pull/110324.diff

5 Files Affected:

(added) llvm/test/TableGen/intrinsic-overload-conflict.td (+43)
(modified) llvm/unittests/IR/IntrinsicsTest.cpp (+62-3)
(modified) llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp (+127)
(modified) llvm/utils/TableGen/Basic/CodeGenIntrinsics.h (+4)
(modified) llvm/utils/TableGen/IntrinsicEmitter.cpp (+1-1)

diff --git a/llvm/test/TableGen/intrinsic-overload-conflict.td b/llvm/test/TableGen/intrinsic-overload-conflict.td
new file mode 100644
index 00000000000000..2e3c5f6f087fff
--- /dev/null
+++ b/llvm/test/TableGen/intrinsic-overload-conflict.td
@@ -0,0 +1,43 @@
+// RUN: llvm-tblgen -gen-intrinsic-enums -I %p/../../include %s -DTEST_INTRINSICS_SUPPRESS_DEFS | FileCheck %s
+// RUN: not llvm-tblgen -gen-intrinsic-enums -I %p/../../include %s -DTEST_INTRINSICS_SUPPRESS_DEFS -DCONFLICT 2>&1 | FileCheck %s -DFILE=%s --check-prefix=CHECK-CONFLICT
+
+
+include "llvm/IR/Intrinsics.td"
+// CHECK: foo = 1,
+def int_foo : Intrinsic<[llvm_any_ty]>;
+
+// No conflicts, since .bar is not a vaid mangled type.
+// CHECK: foo_bar,
+def int_foo_bar : Intrinsic<[llvm_i32_ty]>;
+
+// CHECK: foo_bar_f32,
+def int_foo_bar_f32 : Intrinsic<[llvm_i32_ty]>;
+
+#ifdef CONFLICT
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.a3` cannot share prefix `llvm.foo.a3` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.bf16` cannot share prefix `llvm.foo.bf16` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.f32` cannot share prefix `llvm.foo.f32` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.f64` cannot share prefix `llvm.foo.f64` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.f_3` cannot share prefix `llvm.foo.f_3` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.i65` cannot share prefix `llvm.foo.i65` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.i65.bar` cannot share prefix `llvm.foo.i65` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.p3` cannot share prefix `llvm.foo.p3` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.s_3` cannot share prefix `llvm.foo.s_3` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: error: intrinsic `llvm.foo.v4` cannot share prefix `llvm.foo.v4` with another overloaded intrinsic `llvm.foo`
+// CHECK-CONFLICT: 10 errors
+// CHECK-CONFLICT-NOT: error
+
+// Confcts due to suffix being a posssible mangled type
+def int_foo_f32 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_f64 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_bf16: Intrinsic<[llvm_i32_ty]>;
+def int_foo_p3 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_a3 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_v4 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_func : Intrinsic<[llvm_i32_ty], [], [], "llvm.foo.f_3">;
+def int_foo_struct : Intrinsic<[llvm_i32_ty], [], [], "llvm.foo.s_3">;
+def int_foo_t3 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_i65 : Intrinsic<[llvm_i32_ty]>;
+def int_foo_i65_bar : Intrinsic<[llvm_i32_ty]>;
+
+#endif
diff --git a/llvm/unittests/IR/IntrinsicsTest.cpp b/llvm/unittests/IR/IntrinsicsTest.cpp
index 0c4af28a2ab57b..1f9d17c2e0fe56 100644
--- a/llvm/unittests/IR/IntrinsicsTest.cpp
+++ b/llvm/unittests/IR/IntrinsicsTest.cpp
@@ -63,7 +63,7 @@ class IntrinsicsTest : public ::testing::Test {
 };
 
 TEST(IntrinsicNameLookup, Basic) {
-  static constexpr const char *const NameTable1[] = {
+  static constexpr const char *const NameTable[] = {
       "llvm.foo", "llvm.foo.a", "llvm.foo.b", "llvm.foo.b.a", "llvm.foo.c",
   };
 
@@ -74,15 +74,74 @@ TEST(IntrinsicNameLookup, Basic) {
   };
 
   for (const auto &[Name, ExpectedIdx] : Tests) {
-    int Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable1, Name);
+    int Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable, Name);
     EXPECT_EQ(ExpectedIdx, Idx);
     if (!StringRef(Name).starts_with("llvm.foo"))
       continue;
-    Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable1, Name, "foo");
+    Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable, Name, "foo");
     EXPECT_EQ(ExpectedIdx, Idx);
   }
 }
 
+// Test case to demonstrate potential conflicts between overloaded and non-
+// overloaded intrinsics. The name match works by essentially dividing then
+// name into . separated components and doing successive search for each
+// component. When a search fails, the lowest component of the matching
+// range for the previous component is returned.
+TEST(IntrinsicNameLookup, OverloadConflict) {
+  // Assume possible mangled type strings are just .f32 and .i32.
+  static constexpr const char *const NameTable[] = {
+      "llvm.foo",
+      "llvm.foo.f32",
+      "llvm.foo.i32",
+  };
+
+  // Here, first we match llvm.foo and our search window is [0,2]. Then we try
+  // to match llvm.foo.f64 and there is no match, so it returns the low of the
+  // last match. So this lookup works as expected.
+  int Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable, "llvm.foo.f64");
+  EXPECT_EQ(Idx, 0);
+
+  // Now imagine if llvm.foo has 2 mangling suffixes, .f32 and .f64. The search
+  // will first match llvm.foo to [0, 2] and then llvm.foo.f32 to [1,1] and then
+  // not find any match for llvm.foo.f32.f64. So it will return the low of the
+  // last match, which is llvm.foo.f32. However, the intent was to match
+  // llvm.foo. So the presence of llvm.foo.f32 eliminated the possibility of
+  // matching with llvm.foo. So it seems if we have an intrinsic llvm.foo,
+  // another one with the same suffix and a single .suffix is not going to
+  // cause problems. If there exists another one with 2 or more suffixes,
+  // .suffix0 and .suffix1, its possible that the mangling suffix for llvm.foo
+  // might match with .suffix0 and then the match will fail to match llvm.foo.
+  // .suffix1 won't be a problem because its the last one so the matcher will
+  // try an exact match (in which case exact name in the table was searched for,
+  // so its expected to match that entry).
+  //
+  // This example leads the the following rule: if we have an overloaded
+  // intrinsic with name `llvm.foo` and another one with same prefix and one or
+  // more suffixes, `llvm.foo[.<suffixN>]+`, then the name search will try to
+  // first match against suffix0, then suffix1 etc. If suffix0 can match a
+  // mangled type, then the search for an `llvm.foo` with a mangling suffix can
+  // match against suffix0, preventing a match with `llvm.foo`. If suffix0
+  // cannot match a mangled type, then that cannot happen, so we do not need to
+  // check for later suffixes. Generalizing, the `llvm.foo[.suffixN]+` will
+  // cause a conflict if the first suffix (.suffix0) can match a mangled type
+  // (and then we do not need to check later suffixes) and will not cause a
+  // conflict if it cannot (and then again, we do not need to check for later
+  // suffixes.)
+  Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable, "llvm.foo.f32.f64");
+  EXPECT_EQ(Idx, 1);
+
+  // Here .bar and .f33 do not conflict with the manging suffixes, so the search
+  // should match against llvm.foo.
+  static constexpr const char *const NameTable1[] = {
+      "llvm.foo",
+      "llvm.foo.bar",
+      "llvm.foo.f33",
+  };
+  Idx = Intrinsic::lookupLLVMIntrinsicByName(NameTable1, "llvm.foo.f32.f64");
+  EXPECT_EQ(Idx, 0);
+}
+
 // Tests to verify getIntrinsicForClangBuiltin.
 TEST(IntrinsicNameLookup, ClangBuiltinLookup) {
   using namespace Intrinsic;
diff --git a/llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp b/llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp
index c3bd7efd8387a8..57a73f9e9e399a 100644
--- a/llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp
+++ b/llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp
@@ -76,6 +76,7 @@ CodeGenIntrinsicTable::CodeGenIntrinsicTable(const RecordKeeper &RC) {
 
   CheckDuplicateIntrinsics();
   CheckTargetIndependentIntrinsics();
+  CheckOverloadSuffixConflicts();
 }
 
 // Check for duplicate intrinsic names.
@@ -124,6 +125,132 @@ void CodeGenIntrinsicTable::CheckTargetIndependentIntrinsics() const {
   }
 }
 
+// Return true if the given Suffix looks like mangled type. Note that this
+// check is conservative, but allows all existing LLVM intrinsic suffixes to be
+// consider as not looking like a mangling suffix.
+static bool DoesSuffixLookLikeMangledType(StringRef Suffix) {
+  // Try to match against possible mangling suffixes for various types.
+  // See getMangledTypeStr() for the mangling suffixes possible. It includes
+  //  pointer       : p[0-9]+
+  //  array         : a[0-9]+[.+]
+  //  struct:       : s_/sl_[.+]
+  //  function      : f_[.+]
+  //  vector        : v/nxv[0-9]+[.+]
+  //  target type   : t[.*]
+  //  integer       : i[0-9]+
+  //  named types   : See `NamedTypes` below.
+
+  // Match anything with an _, so match function and struct types.
+  if (Suffix.contains('_'))
+    return true;
+
+  // [a|v][0-9|$][.*] // $ is end of string.
+  if (is_contained("av", Suffix[0]) &&
+      (Suffix.size() == 1 || isDigit(Suffix[1])))
+    return true;
+
+  // nxv[0-9|$][.*]
+  if (Suffix.starts_with("nxv") && (Suffix.size() == 3 || isDigit(Suffix[3])))
+    return true;
+  // t[.*]
+  if (Suffix.starts_with('t'))
+    return false;
+
+  // [p|i][0-9]+
+  if ((Suffix[0] == 'i' || Suffix[0] == 'p') &&
+      all_of(Suffix.drop_front(), isDigit))
+    return true;
+
+  // Match one of the named types.
+  StringLiteral NamedTypes[] = {"isVoid",  "Metadata", "f16",  "f32",
+                                "f64",     "f80",      "f128", "bf16",
+                                "ppcf128", "x86amx"};
+  return is_contained(NamedTypes, Suffix);
+}
+
+// Check for conflicts with overloaded intrinsics. If there exists an overloaded
+// intrinsic with base name `llvm.target.foo`, LLVM will add a mangling suffix
+// to it to encode the overload types. This mangling suffix is 1 or more .
+// prefixed mangled type string as defined in `getMangledTypeStr`. If there
+// exists another intrinsic `llvm.target.foo[.<suffixN>]+`, which has the same
+// prefix as the overloaded intrinsic, its possible that there may be a name
+// conflict with the overloaded intrinsic and either one may interfere with name
+// lookup for the other, leading to wrong intrinsic ID being assigned.
+//
+// The actual name lookup in the intrinsic name table is done by a search
+// on each successive . separted component of the intrinsic name (see
+// `lookupLLVMIntrinsicByName`). Consider first the case where there exists a
+// non-overloaded intrinsic `llvm.target.foo[.suffix]+`. For the non-overloaded
+// intrinsics, the name lookup is an exact match, so the presence of the
+// overloaded intrinsic with the same prefix will not interfere with the
+// search. However, a lookup intended to match the overloaded intrinsic might be
+// affected by the presence of another entry in the name table with the same
+// prefix. See the `OverloadConflict` sub-test in IntrinsicsTest.cpp to
+// demonstrate the cases where there is a conflict and for the exact check
+// (replicated below) for when the conflict can or cannot happen.
+//
+// Since LLVM's name lookup first selects the target specific (or target
+// independent) slice of the name table to look into, intrinsics in 2 different
+// slices cannot conflict with each other. Within a specific slice,
+// if we have an overloaded intrinsic with name `llvm.target.foo` and another
+// one with same prefix and one or more suffixes `llvm.target.foo[.<suffixN>]+`,
+// then the name search will try to first match against suffix0, then suffix1
+// etc. If suffix0 can match a mangled type, then the search for an
+// `llvm.target.foo` with a mangling suffix can match against suffix0,
+// preventing a match with `llvm.target.foo`. If suffix0 cannot match a mangled
+// type, then that cannot happen, so we do not need to check for later suffixes.
+//
+// Generalizing, the `llvm.target.foo[.suffixN]+` will cause a conflict if the
+// first suffix (.suffix0) can match a mangled type (and then we do not need to
+// check later suffixes) and will not cause a conflict if it cannot (and then
+// again, we do not need to check for later suffixes.)
+void CodeGenIntrinsicTable::CheckOverloadSuffixConflicts() const {
+  for (const TargetSet &Set : Targets) {
+    const CodeGenIntrinsic *Overloaded = nullptr;
+    for (const CodeGenIntrinsic &Int : (*this)[Set]) {
+      // If we do not have an overloaded intrinsic to check against, nothing
+      // to do except potentially identifying this as a candidate for checking
+      // against in future iteration.
+      if (!Overloaded) {
+        if (Int.isOverloaded)
+          Overloaded = &Int;
+        continue;
+      }
+
+      StringRef Name = Int.Name;
+      StringRef OverloadName = Overloaded->Name;
+      // If we have an overloaded intrinsic to check again, check if its name is
+      // a proper prefix of this intrinsic.
+      if (Name.starts_with(OverloadName) && Name[OverloadName.size()] == '.') {
+        // If yes, verify suffixes and flag an error.
+        StringRef Suffixes = Name.drop_front(OverloadName.size() + 1);
+
+        // Only need to look at the first suffix.
+        StringRef Suffix0 = Suffixes.split('.').first;
+
+        if (!DoesSuffixLookLikeMangledType(Suffix0))
+          continue;
+
+        unsigned SuffixSize = OverloadName.size() + 1 + Suffix0.size();
+        // If suffix looks like mangling suffix, flag it as an error.
+        PrintError(Int.TheDef->getLoc(),
+                   "intrinsic `" + Name + "` cannot share prefix `" +
+                       Name.take_front(SuffixSize) +
+                       "` with another overloaded intrinsic `" + OverloadName +
+                       "`");
+        PrintNote(Overloaded->TheDef->getLoc(),
+                  "Overloaded intrinsic `" + OverloadName + "` defined here");
+        continue;
+      }
+
+      // If we find an intrinsic that is not a proper prefix, any later
+      // intrinsic is also not going to be a proper prefix, so invalidate the
+      // overloaded to check against.
+      Overloaded = nullptr;
+    }
+  }
+}
+
 CodeGenIntrinsic &CodeGenIntrinsicMap::operator[](const Record *Record) {
   if (!Record->isSubClassOf("Intrinsic"))
     PrintFatalError("Intrinsic defs should be subclass of 'Intrinsic' class");
diff --git a/llvm/utils/TableGen/Basic/CodeGenIntrinsics.h b/llvm/utils/TableGen/Basic/CodeGenIntrinsics.h
index 1cdeaacd52dcdb..cf6375c77c133a 100644
--- a/llvm/utils/TableGen/Basic/CodeGenIntrinsics.h
+++ b/llvm/utils/TableGen/Basic/CodeGenIntrinsics.h
@@ -186,6 +186,9 @@ class CodeGenIntrinsicTable {
   auto begin() const { return Intrinsics.begin(); }
   auto end() const { return Intrinsics.end(); }
   CodeGenIntrinsic &operator[](size_t Pos) { return Intrinsics[Pos]; }
+  ArrayRef<CodeGenIntrinsic> operator[](const TargetSet &Set) const {
+    return ArrayRef(&Intrinsics[Set.Offset], Set.Count);
+  }
   const CodeGenIntrinsic &operator[](size_t Pos) const {
     return Intrinsics[Pos];
   }
@@ -193,6 +196,7 @@ class CodeGenIntrinsicTable {
 private:
   void CheckDuplicateIntrinsics() const;
   void CheckTargetIndependentIntrinsics() const;
+  void CheckOverloadSuffixConflicts() const;
 };
 
 // This class builds `CodeGenIntrinsic` on demand for a given Def.
diff --git a/llvm/utils/TableGen/IntrinsicEmitter.cpp b/llvm/utils/TableGen/IntrinsicEmitter.cpp
index efa067e60de439..d5e5cf1a3b5685 100644
--- a/llvm/utils/TableGen/IntrinsicEmitter.cpp
+++ b/llvm/utils/TableGen/IntrinsicEmitter.cpp
@@ -154,7 +154,7 @@ void IntrinsicEmitter::EmitEnumInfo(const CodeGenIntrinsicTable &Ints,
 
   OS << "// Enum values for intrinsics.\n";
   bool First = true;
-  for (const auto &Int : ArrayRef(&Ints[Set->Offset], Set->Count)) {
+  for (const auto &Int : Ints[*Set]) {
     OS << "    " << Int.EnumName;
 
     // Assign a value to the first intrinsic in this target set so that all

llvm/test/TableGen/intrinsic-overload-conflict.td

llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp

nikic

TBH, if we can't forbid this generically due to existing intrinsics, I'm not sure it's really worthwhile to do this kind of heuristic matching. This looks like a decent bit of complexity to solve a theoretical problem.

I think I'd prefer doing nothing over doing this. Another alternative would be to grandfather in the handful of existing intrinsics that wouldn't pass a stricter prefix check.

jurahul · 2024-10-01T14:10:01Z

TBH, if we can't forbid this generically due to existing intrinsics, I'm not sure it's really worthwhile to do this kind of heuristic matching. This looks like a decent bit of complexity to solve a theoretical problem.

I think I'd prefer doing nothing over doing this. Another alternative would be to grandfather in the handful of existing intrinsics that wouldn't pass a stricter prefix check.

I agree that this is a theoretical problem, in the sense, the origin is not that someone ran into this. Though it's possible folks may have run into this and figured out the issue and fixed it in their intrinsic definitions, so this would avoid potential debug cycles in such cases. Having a stricter check and then grandfathering in a few intrinsics we have also sounds reasonable. Let me see how many intrinsics we are talking about (IIRC, > a dozen). If we just have a whitelist of intrinsics for which we skip this check, this will work upstream, I wonder what happens for downstream backends. I guess they will have to maintain a diff in their downstream code for additional intrinsics that should skip this check. If we really wanted to make this generic, we could add a new property to Intrinsics.td to specify that this check can be skipped (something like NoNameConflict) and then add it to all the whitelisted ones, so there is no list maintained in the code, but that would be even more heavy weight mechanism to handle this.

Let me prototype the whitelist approach and see how it looks.

arsenm · 2024-10-01T14:24:42Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

jurahul · 2024-10-01T14:56:12Z

Thanks @arsenm . It's not theoretical then at this point, and I suspect some other folks have likely run into this. @nikic does that make you reconsider your position with this PR, or should I still go investigate alternatives.

jurahul · 2024-10-04T22:44:25Z

@nikic and @arsenm, I am thinking if the current approach is too complex, may be a simpler check with an opt-out property on the intrinsic for intrinsics where we know it's safe is the alternative to pursue, as opposed to a hard coded list of whitelisted intrinsic names in the IntrinsicEmitter.cpp file. Along with the property that we will add, we will add a comment describing when exactly there could be a conflict. That means any new intrinsics added will get opted in into this check and folks have to actively opt-opt once they verify its safe. The property can be called [NoOverloadNameConflict] (or better name welcome) and will be added to Intrinsics.td. Please let me know if that sounds like a reasonable approach. I'll report here how many intrinsics we expect to tag with this to get them grandfathered in.

jurahul · 2024-10-04T22:53:17Z

We'd need to tag 225 intrinsics. If instead we say that the tag can be on either the overloaded intrinsic or the one that shares the prefix with the overloaded one, its 113 (overloaded) intrinsics that we'd need to tag.

arsenm · 2024-10-09T07:33:20Z

We'd need to tag 225 intrinsics. If instead we say that the tag can be on either the overloaded intrinsic or the one that shares the prefix with the overloaded one, its 113 (overloaded) intrinsics that we'd need to tag.

Adding a bit to change the naming rules is the worst possible outcome. I think we do this, or we do nothing

jurahul · 2024-10-09T15:44:57Z

We'd need to tag 225 intrinsics. If instead we say that the tag can be on either the overloaded intrinsic or the one that shares the prefix with the overloaded one, its 113 (overloaded) intrinsics that we'd need to tag.

Adding a bit to change the naming rules is the worst possible outcome. I think we do this, or we do nothing

By "this", you mean this PR as is, right? That is, you don't like the custom bit to opt-out of any checks. If so, I'd say we either commit this PR or atleast add a comment/keep the test to document the conflict rules (since I don't think they are as obvious) for folks to refer to. Let's wait for @nikic to comment as well and decide next steps. @nikic gentle ping when you get a chance, PTAL at the discussion above and advise next steps.

nikic · 2024-10-13T18:53:48Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

Could you please elaborate on this? I don't really understand what kind of issue you can hit when removing intrinsics.

jurahul · 2024-10-14T14:23:40Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

Could you please elaborate on this? I don't really understand what kind of issue you can hit when removing intrinsics.

May be @arsenm meant "renaming" and not "removing"? @arsenm can you clarify?

arsenm · 2024-10-14T14:24:10Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

Could you please elaborate on this? I don't really understand what kind of issue you can hit when removing intrinsics.

In 9d36428, I removed the int_amdgcn_flat_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd_v2bf16 intrinsics. These should never have been added. We already had int_amdgcn_global_atomic_fadd and int_amdgcn_flat_atomic_fadd overloaded on the type, so we had a collision between the overloaded int_amdgcn_global_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd with v2bf16.

I don't remember the details, but something confusing was going on. I'm not sure if the int_amdgcn_global_atomic_fadd_v2bf16 case was ever really parsed. You could create the intrinsic with the suffixed ID, but then it would be recognized as the mangled version, or something like that.

jurahul · 2024-10-14T15:32:11Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

Could you please elaborate on this? I don't really understand what kind of issue you can hit when removing intrinsics.

In 9d36428, I removed the int_amdgcn_flat_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd_v2bf16 intrinsics. These should never have been added. We already had int_amdgcn_global_atomic_fadd and int_amdgcn_flat_atomic_fadd overloaded on the type, so we had a collision between the overloaded int_amdgcn_global_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd with v2bf16.

I don't remember the details, but something confusing was going on. I'm not sure if the int_amdgcn_global_atomic_fadd_v2bf16 case was ever really parsed. You could create the intrinsic with the suffixed ID, but then it would be recognized as the mangled version, or something like that.

Thanks. Based on my understanding of the current code, in this situation, we would have always matched the int_amdgcn_flat_atomic_fadd_v2bf16 and never matched the int_amdgcn_flat_atomic_fadd overloaded intrinsic with v2bf16 type. Which means it would not be possible to fully test the overloaded intrinsics for these types.

The check I am adding would have flagged that, since .v216 is a valid mangled type suffix, and in this intrinsic, since the first suffix matches a mangled type suffix, it would have flagged that as an error.

jurahul · 2024-10-14T15:38:07Z

I did actually run into this oddity when removing some of the amdgpu atomic intrinsics

Could you please elaborate on this? I don't really understand what kind of issue you can hit when removing intrinsics.

In 9d36428, I removed the int_amdgcn_flat_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd_v2bf16 intrinsics. These should never have been added. We already had int_amdgcn_global_atomic_fadd and int_amdgcn_flat_atomic_fadd overloaded on the type, so we had a collision between the overloaded int_amdgcn_global_atomic_fadd_v2bf16 and int_amdgcn_global_atomic_fadd with v2bf16.
I don't remember the details, but something confusing was going on. I'm not sure if the int_amdgcn_global_atomic_fadd_v2bf16 case was ever really parsed. You could create the intrinsic with the suffixed ID, but then it would be recognized as the mangled version, or something like that.

Thanks. Based on my understanding of the current code, in this situation, we would have always matched the int_amdgcn_flat_atomic_fadd_v2bf16 and never matched the int_amdgcn_flat_atomic_fadd overloaded intrinsic with v2bf16 type. Which means it would not be possible to fully test the overloaded intrinsics for these types.

The check I am adding would have flagged that, since .v216 is a valid mangled type suffix, and in this intrinsic, since the first suffix matches a mangled type suffix, it would have flagged that as an error.

Just a further point of clarification, it would have been flagged when the last one of the int_amdgcn_flat_atomic_fadd or int_amdgcn_flat_atomic_fadd_v2bf16 was added in the first place.

jurahul · 2024-10-14T15:49:47Z

@nikic (me and @arsenm to an extent, based on the PR approval) are arguing that the extra complexity is worth it as it avoids hard-to-debug issues that developers may be running into and working around by trial and error.

llvm/unittests/IR/IntrinsicsTest.cpp

llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp

nikic

LGTM

llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp

llvm/unittests/IR/IntrinsicsTest.cpp

…llvm#110324) Check name conflicts between intrinsics caused by mangling suffix. If the base name of an overloaded intrinsic is a proper prefix of another intrinsic, check if the other intrinsic name suffix after the proper prefix can match a mangled type and issue an error if it can.

jurahul force-pushed the intrinsic_overload_prefix_check_take2 branch 2 times, most recently from 46345fb to b15f09d Compare September 30, 2024 14:02

jurahul marked this pull request as ready for review September 30, 2024 17:07

jurahul requested review from arsenm and nikic September 30, 2024 17:08

llvmbot added tablegen llvm:ir labels Sep 30, 2024

arsenm reviewed Sep 30, 2024

View reviewed changes

jurahul force-pushed the intrinsic_overload_prefix_check_take2 branch from b15f09d to 85a6422 Compare September 30, 2024 17:47

jurahul requested a review from arsenm September 30, 2024 17:48

jurahul mentioned this pull request Sep 30, 2024

[LLVM][TableGen] Add overloaded intrinsic name conflict checks #109314

Closed

arsenm approved these changes Oct 1, 2024

View reviewed changes

nikic reviewed Oct 1, 2024

View reviewed changes

jurahul requested a review from nikic October 2, 2024 13:08

nikic reviewed Oct 14, 2024

View reviewed changes

jurahul force-pushed the intrinsic_overload_prefix_check_take2 branch from 85a6422 to 90a7414 Compare October 14, 2024 18:41

nikic approved these changes Oct 14, 2024

View reviewed changes

llvm/utils/TableGen/Basic/CodeGenIntrinsics.cpp Outdated Show resolved Hide resolved

llvm/unittests/IR/IntrinsicsTest.cpp Outdated Show resolved Hide resolved

[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts

481c353

jurahul force-pushed the intrinsic_overload_prefix_check_take2 branch from 90a7414 to 481c353 Compare October 14, 2024 20:14

jurahul merged commit 2a0073f into llvm:main Oct 15, 2024
8 checks passed

jurahul deleted the intrinsic_overload_prefix_check_take2 branch October 15, 2024 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts #110324

[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts #110324

jurahul commented Sep 27, 2024 •

edited

Loading

llvmbot commented Sep 30, 2024 •

edited

Loading

nikic left a comment

jurahul commented Oct 1, 2024

arsenm commented Oct 1, 2024

jurahul commented Oct 1, 2024

jurahul commented Oct 4, 2024

jurahul commented Oct 4, 2024

arsenm commented Oct 9, 2024

jurahul commented Oct 9, 2024

nikic commented Oct 13, 2024

jurahul commented Oct 14, 2024

arsenm commented Oct 14, 2024

jurahul commented Oct 14, 2024

jurahul commented Oct 14, 2024

jurahul commented Oct 14, 2024

nikic left a comment

[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts #110324

[LLVM][TableGen] Check overloaded intrinsic mangling suffix conflicts #110324

Conversation

jurahul commented Sep 27, 2024 • edited Loading

llvmbot commented Sep 30, 2024 • edited Loading

nikic left a comment

Choose a reason for hiding this comment

jurahul commented Oct 1, 2024

arsenm commented Oct 1, 2024

jurahul commented Oct 1, 2024

jurahul commented Oct 4, 2024

jurahul commented Oct 4, 2024

arsenm commented Oct 9, 2024

jurahul commented Oct 9, 2024

nikic commented Oct 13, 2024

jurahul commented Oct 14, 2024

arsenm commented Oct 14, 2024

jurahul commented Oct 14, 2024

jurahul commented Oct 14, 2024

jurahul commented Oct 14, 2024

nikic left a comment

Choose a reason for hiding this comment

jurahul commented Sep 27, 2024 •

edited

Loading

llvmbot commented Sep 30, 2024 •

edited

Loading