performance: avoid O(n^2) in PDEJavaHelper #747

jukzi · 2023-09-18T13:02:29Z

findPackageFragmentRoot() searches through all PackageFragment Roots and is called for every libPaths. This can be slow due to involved file access.
see eclipse-jdt/eclipse.jdt.core#303

Instead call getAllPackageFragmentRoots() only once, index the result and use O(1) hash access.

github-actions · 2023-09-18T13:27:10Z

Test Results

  261 files -     12   261 suites - 12 43m 41s ⏱️ - 21m 30s
3 341 tests +      1 3 308 ✔️ -       1 30 💤 ±  0 3 ❌ +3
4 895 runs - 5 422 4 850 ✔️ - 5 376 42 💤 - 48 3 ❌ +3

For more details on these failures, see this check.

Results for commit 42c0a49. ± Comparison against base commit f5c8301.

♻️ This comment has been updated with latest results.

HannesWell

Isn't the caching better done in the IJavaProject implementation?
Then all callers benefit from it immediatly and the cache has to be build only once. And the impl probably also knows best when to invalidate the cache.

ui/org.eclipse.pde.core/src/org/eclipse/pde/internal/core/util/PDEJavaHelper.java

jukzi · 2023-09-19T06:44:57Z

Isn't the caching better done in the IJavaProject implementation?

i don't know who this could be implemented as the result currently depends on files/directory on filesystem, which could have changed.

Then all callers benefit from it immediatly and the cache has to be build only once. And the impl probably also knows best when to invalidate the cache.

together with eclipse-jdt/eclipse.jdt.ui#796 i just adapted all known repeated callers.

jukzi · 2023-09-19T13:43:25Z

@vogella please undo "merge" and use rebase instead

HannesWell · 2023-09-19T23:46:18Z

Isn't the caching better done in the IJavaProject implementation?

i don't know who this could be implemented as the result currently depends on files/directory on filesystem, which could have changed.

If it is querying the file-system over and over again it would be an even better reason to cache it for all since we both know that this is slow.

Is it querying the files directly through standard java API or through Eclipse IResources? Because in case of the later a IResourceChangeListener could be registered to get notified about deltas in the file-system.

jukzi · 2023-09-20T06:02:14Z

Is it querying the files directly through standard java API or through Eclipse IResources?

no - see stacktrace in eclipse-jdt/eclipse.jdt.core#303 - java.io

HannesWell · 2023-09-20T08:18:26Z

Is it querying the files directly through standard java API or through Eclipse IResources?

no - see stacktrace in eclipse-jdt/eclipse.jdt.core#303 - java.io

Too bad.
In generell one could use native Filesystem-Hooks, but that's probably a bit too complicated for this purpose.

HannesWell · 2023-09-20T20:47:46Z

ui/org.eclipse.pde.core/src/org/eclipse/pde/internal/core/util/PDEJavaHelper.java

+					if (classRootPath != null) {
+						rootsByPath.put(classRootPath, classpathRoot);
+					}
+				}
 				ListIterator<IPath> li = libPaths.listIterator();
 				while (li.hasNext()) {


Since you already touched this, could you please convert this while loop and if you want the outer for loop over all projects into an enhanced for loop?
Btw. I wonder why there is no quick-fix that suggest to convert this iterator+while into an enhanced for-loop.

there is a cleanup, that could do this. but it currently fails. let's wait till that is fixed: eclipse-jdt/eclipse.jdt.ui#798

HannesWell · 2023-09-20T21:09:44Z

In general wonder if the number of libraries is that often greater than one.
For a simple project base.getPluginBase().getLibraries() returns an empty array and thus libPaths only contains the project's path as single element.

For In that case it is probably faster to keep the current behavior since caching will then probably be more expensive because it collects all roots in any case, even if the first one would already match. Plus the memory overhead.

A all pleasing solution would probably be something like:

Function<IPath, IPackageFragmentRoot> findRootWithPath;                                 
if (libPaths.size() == 1) {                                                     
	findRootWithPath = p -> {                                                           
		try {                                                                   
			return jp.findPackageFragmentRoot(p);                               
		} catch (JavaModelException e) {                                        
			return null;                                                        
		}                                                                       
	};                                                                          
} else {                                                                        
	Map<IPath, IPackageFragmentRoot> rootsByPath = new HashMap<>();             
	for (IPackageFragmentRoot classpathRoot : jp.getAllPackageFragmentRoots()) {
		IPath classRootPath = classpathRoot.getPath();                          
		if (classRootPath != null) {                                            
			rootsByPath.put(classRootPath, classpathRoot);                      
		}                                                                       
	}                                                                           
	findRootWithPath = rootsByPath::get;                                                
}                                                                               
Optional<IPackageFragment> fragment = libPaths.stream().map(findRootWithPath).filter(Objects::nonNull)
	.map(root -> root.getPackageFragment(packageName)).filter(IPackageFragment::exists).findFirst();
if (fragment.isPresent()) {
	return fragment.get();
}

Although it is now much longer than before.

jukzi · 2023-09-25T08:23:41Z

Although it is now much longer than before.

Thats not an even a performance improvement. findPackageFragmentRoot internally calls getAllPackageFragmentRoots anyway.

findPackageFragmentRoot() searches through all PackageFragment Roots and is called for every libPaths. This can be slow due to involved file access. see eclipse-jdt/eclipse.jdt.core#303 Instead call getAllPackageFragmentRoots() only once, index the result and use O(1) hash access.

HannesWell · 2023-09-25T22:47:12Z

Although it is now much longer than before.

Thats not an even a performance improvement. findPackageFragmentRoot internally calls getAllPackageFragmentRoots anyway.

OK, then just disregard my remark and AFAICT this looks fine to me.

Maybe it would be worth to consider in JDT to make findPackageFragmentRoot() a short-circuit operation?

jukzi · 2023-09-26T07:20:07Z

Maybe it would be worth to consider in JDT to make findPackageFragmentRoot() a short-circuit operation?

PRs welcome. i can review.

HannesWell reviewed Sep 18, 2023

View reviewed changes

ui/org.eclipse.pde.core/src/org/eclipse/pde/internal/core/util/PDEJavaHelper.java Outdated Show resolved Hide resolved

jukzi force-pushed the getAllPackageFragmentRoots branch from 1bd180a to ce63951 Compare September 20, 2023 06:12

HannesWell reviewed Sep 20, 2023

View reviewed changes

jukzi force-pushed the getAllPackageFragmentRoots branch from ce63951 to 5256249 Compare September 25, 2023 08:52

HannesWell force-pushed the getAllPackageFragmentRoots branch from 5256249 to 42c0a49 Compare September 25, 2023 22:45

jukzi merged commit a6601a5 into eclipse-pde:master Sep 26, 2023
11 of 14 checks passed

jukzi deleted the getAllPackageFragmentRoots branch September 26, 2023 07:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance: avoid O(n^2) in PDEJavaHelper #747

performance: avoid O(n^2) in PDEJavaHelper #747

jukzi commented Sep 18, 2023

github-actions bot commented Sep 18, 2023 •

edited

Loading

HannesWell left a comment

jukzi commented Sep 19, 2023

jukzi commented Sep 19, 2023

HannesWell commented Sep 19, 2023

jukzi commented Sep 20, 2023

HannesWell commented Sep 20, 2023

HannesWell Sep 20, 2023

jukzi Sep 25, 2023

HannesWell commented Sep 20, 2023

jukzi commented Sep 25, 2023

HannesWell commented Sep 25, 2023

jukzi commented Sep 26, 2023

performance: avoid O(n^2) in PDEJavaHelper #747

performance: avoid O(n^2) in PDEJavaHelper #747

Conversation

jukzi commented Sep 18, 2023

github-actions bot commented Sep 18, 2023 • edited Loading

Test Results

HannesWell left a comment

Choose a reason for hiding this comment

jukzi commented Sep 19, 2023

jukzi commented Sep 19, 2023

HannesWell commented Sep 19, 2023

jukzi commented Sep 20, 2023

HannesWell commented Sep 20, 2023

HannesWell Sep 20, 2023

Choose a reason for hiding this comment

jukzi Sep 25, 2023

Choose a reason for hiding this comment

HannesWell commented Sep 20, 2023

jukzi commented Sep 25, 2023

HannesWell commented Sep 25, 2023

jukzi commented Sep 26, 2023

github-actions bot commented Sep 18, 2023 •

edited

Loading