Add Louvain community detection algorithm by Becheler · Pull Request #453 · boostorg/graph

Becheler · 2026-02-02T13:01:23Z

Implement #447

Multi-level modularity optimization following Blondel et al. (2008)
Supports custom quality functions (other than modularity) with policy-based design (extensions to come to propose alternative quality functions to match gen-louvain)
Incremental quality tracking
Lazy rollback
Vertex shuffling
Competitive with established implementation, see benchmarks here

Review-driven changes implemented in this PR:

Review items still open:

Make internal containers fully user-configurable via templating the map type (avoid hard-coding std::map/std::set; consider flat_map default)

I reviewed allocating container in the file (4 maps/sets and ~13 vectors). They all live in louvain_detail internals or are local scratch inside the public function, they do not appear in the API signature.

The maps operate on types created by the internal aggregate() (agg_vertex_t, community_type) that the user never sees. The set only allocates in the slow/non-incremental fallback path. The final remap is O(k) and one-shot and exists only to give meaningful contiguous labels. The vectors back iterator_property_map, which requires random-access iterators.

All maps/sets already use boost::unordered_flat_map/unordered_flat_set so the flat_map suggestion is effectively already done.

Templating the container type would add a template parameter that leaks internal types into the public API for no user-observable benefit : the containers are transient scratch rebuilt each pass, not long-lived state.

In short I would rather keep the API simple for now and avoid making too much noise (the genericity level looks already fairly complex to me), but I would be happy to revisit if a concrete use case comes up. I would rather focus on a next PR about more direct benefits:

Supporting undirected CSR graphs
Supporting directed graphs (requires quality function tweaks)
Supporting generic termination criterion as different application domains may require different stopping conditions, including fixed gain thresholds (Campigotto et al., 2014), threshold scaling (Halappanavar et al., 2017), decisions learned from gain decay patterns, or number of vertices moved, or a mix of those.

jeremy-murphy · 2026-02-04T22:53:34Z

Not sure why this error is occurring only for OSX 11.7 C++14. I assume this is not specific to your code.

In file included from ../../../boost/container/detail/operator_new_helpers.hpp:26:
../../../boost/container/detail/aligned_allocation.hpp:87:26: error: no member named 'aligned_alloc' in the global namespace; did you mean 'aligned_allocate'?
   return rounded_size ? ::aligned_alloc(al, rounded_size) : 0;
                         ^~~~~~~~~~~~~~~
                         aligned_allocate
../../../boost/container/detail/aligned_allocation.hpp:77:14: note: 'aligned_allocate' declared here
inline void* aligned_allocate(std::size_t al, std::size_t sz)
             ^
1 error generated.

jeremy-murphy

Just a few comments, more later.

include/boost/graph/louvain_clustering.hpp

jeremy-murphy · 2026-02-04T22:59:53Z

include/boost/graph/louvain_clustering.hpp

+    std::map<VertexDescriptor, WeightType> internal_weights;
+    std::map<VertexDescriptor, std::set<VertexDescriptor>> vertex_mapping;


We generally try to avoid hard-coding the choice of data structure, especially std::map and std::set, so instead of templating VertexDescriptor and WeightType we should template the whole map type, so users can use boost::unordered_map or some other kind of property map of their own choice.

Mhhh I see. I felt it was not in the BGL spirit to do so, and Joaquin also mentioned it, but I was not sure how to solve it without making the API heavy. Can we default to a concrete type to simplify user's experience ? Also, is it ok to use boost::unordered_map if it adds the constraint of key_type being hashable ? That was my idea behind using std::map

Oh yes, we can still (and should) make the user experience nice with defaults. That's the great thing, we get both benefits, the cost is more work on the part of the library authors. :)
Again, I think astar is an example, but instead of using param = arg in the function definition, make an overload, like so:

auto user_friendly_foo(graph const &g) { ConcreteA a; ConcreteB b; return generic_foo(g, a, b); }

Sorry, typing on my phone, so please ignore random syntax errors.

Umm, yeah, we probably shouldn't add new constraints into the default interface. Users will too easily assume that the constraint is mandatory.
So maybe use boost::flat_map as the default and users can always use a hash map if they want to.

jeremy-murphy · 2026-02-04T23:05:44Z

include/boost/graph/louvain_clustering.hpp

+    std::set<community_type> unique_communities;
+    std::map<community_type, vertex_descriptor> comm_to_vertex;
+    std::map<vertex_descriptor, std::set<vertex_descriptor>> vertex_to_originals;


These should almost certainly be input parameters taken by non-const reference so that the user a) decides their type and b) automatically gets their value at the end.

So they gain access to the whole hierarchy. Ufff it's a lot of guts leaking out haha
Will do, and tell you in case of problemsm thanks again for your time !

I could be wrong. But have a look at the astar API for some examples of prior art.

And on second thought, this is not a priority, we can always do it later. Getting it correct and fast are higher priorities.

this was not supposed to be commited :)

include/boost/graph/louvain_clustering.hpp

include/boost/graph/louvain_quality_functions.hpp

include/boost/graph/louvain_clustering.hpp

joaquintides · 2026-02-07T10:34:17Z

include/boost/graph/louvain_quality_functions.hpp

+// L_c = internal edge weight for community c
+// k_c = sum of degrees in community c
+// m = total edge weight / 2
+struct newman_and_girvan


Is this modularity thing a general concept or does it apply to Louvain only?

This is a yes and no situation.

The modularity can be used outside of louvain to assess partition quality of a graph.

But the current implementation with incremental computations (remove, insert, gain) is particularly suited for Louvain.

Making it generally useful would require to disentangle the two aspects.

But I would rather do it in a clustering folder so we can have:

include/boost/graph/clustering/ ├── quality_functions.hpp # 10 incremental metrics/criterions for gen-louvain ├── label_propagation.hpp # another clustering method (future work) ├── leiden.hpp # Leiden algorithm (future work) ├── louvain.hpp # Louvain algorithm ├── girvan_newman.hpp # Edge betweenness clustering (currently in bc_clustering.hpp)

jeremy-murphy

Couple more requests, still going...

include/boost/graph/louvain_clustering.hpp

…, type mismatch in for loop

…ation paths

doc/louvain_clustering.html

joaquintides · 2026-02-20T10:41:30Z

doc/louvain_clustering.html

+
+<H3>Parameters</H3>
+
+IN: <tt>const Graph&amp; g</tt>


The algorithm has the additional requirement that vertices are copyable, hashable etc., as they're internally stored in unordered_sets.

You're right. I have been changing the vertices handling in this aspect because it was not friendly with some types of graphs. The interface now takes a VertexIndexMap but I still have to commit those changes, sorry 😓
I will update the documentation in that sense once I merged the new stuff

Is this still an open issue or resolved?

include/boost/graph/louvain_clustering.hpp

jeremy-murphy · 2026-02-24T11:24:47Z

Btw, are the benchmarks in the description current? I mean, did the performance change at all since you swapped some standard map containers for Boost ones?
I'm curious to know if it made a difference but in the description we just want whatever is the latest benchmark.

Becheler · 2026-02-24T11:40:53Z

@jeremy-murphy

I have this benchmark repository here: https://github.com/Becheler/boost-graph-benchmarks/tree/main/louvain
they are run locally then results pushed, but we chatted with Joaquin about making it a CI thing
switching hashmaps did not improve performance, but I have been switching to indexing anyway because it was more generic for some types of graphs
I haven't done a full benchmark since the last changes (its a few hours of computation lol) so the figures are somewhat outdated
The seemingly really fast performance gain between genlouvain and igraph comes from a different tradeoff between termination criterion and partition quality:
- I compared bgl 0 threshold against genlouvain 10-6 threshold.
- I aligned their behavior along the same total number of passes
- the decreasing curves answer "what percentage of total gain is still unrealized after n passes"
- the little bars answer "how much this pass contributed"
- and you can see the 5 levels of aggregation as the algorithm tries to find the best partition into communities
- what is interesting is that local optimization in the first level eats most of the passes for very little gain.
- min_inner_threshold affect the number of passes spent inside each level
- min_outer_threshold affect the total number of levels explored
- And so the graph shows that (for this graph case) min_inner_threshold is the main driver of the number of iterations:

jeremy-murphy · 2026-02-28T16:10:12Z

Btw, you can use - [ ] and - [x] in the markup to make a todo list and tick things off as they're done. It will make it easier for us as reviewers to know where you are up to. E.g.:

done
not done

jeremy-murphy · 2026-03-11T23:00:27Z

include/boost/graph/louvain_clustering.hpp

+#ifndef BOOST_GRAPH_LOUVAIN_TRUST_AGGREGATED_Q
+#define BOOST_GRAPH_LOUVAIN_TRUST_AGGREGATED_Q 0
+#endif
+
+#ifndef BOOST_GRAPH_LOUVAIN_TRACK_PEAK_Q
+#define BOOST_GRAPH_LOUVAIN_TRACK_PEAK_Q 0
+#endif


Ahh, I only just discovered these macros, sorry, and I'm just wondering if they are just meant for private experimental purposes or if users would want to use them too?

If they're for private experiments, I think I would prefer they were kept on a private branch. I mean, I think we should only merge into develop the code that is useful to everyone.

If they are useful to everyone, then they need to be documented and I don't think macros are really OK.

I think once this is resolved, then we're good to merge. :)

Becheler added 3 commits February 2, 2026 13:48

Add Louvain clustering algorithm

374db7b

adding louvain tests to jamfile

6ef3267

add some comments

76efd88

jeremy-murphy self-assigned this Feb 4, 2026

jeremy-murphy added the enhancement label Feb 4, 2026

jeremy-murphy reviewed Feb 4, 2026

View reviewed changes

Becheler added 2 commits February 5, 2026 00:45

Delete scratch/benchmark/run_benchmark.sh

de9b6a8

this was not supposed to be commited :)

Delete scratch/benchmark/bgl_louvain.cpp

385e8c8

this was not supposed to be commited :)

joaquintides reviewed Feb 6, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Show resolved Hide resolved

joaquintides reviewed Feb 6, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Show resolved Hide resolved

joaquintides reviewed Feb 6, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

joaquintides reviewed Feb 6, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

joaquintides reviewed Feb 7, 2026

View reviewed changes

include/boost/graph/louvain_quality_functions.hpp Show resolved Hide resolved

joaquintides reviewed Feb 7, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

joaquintides reviewed Feb 7, 2026

View reviewed changes

jeremy-murphy requested changes Feb 8, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

include/boost/graph/louvain_clustering.hpp Show resolved Hide resolved

Becheler added 11 commits February 9, 2026 16:03

PR review: fixed copyright, local optimization visibility, assertions…

78d9225

…, type mismatch in for loop

fix: URGB made generic

422d376

adding LouvainQualityFunctionConcept

6b278b8

incremental versus non-incremental concepts

28721e1

fix wrong namespace

c5c9ac4

fix unused variables in concepts

24002db

incremental and non incremental metrics can lead to different optimiz…

a02bc0f

…ation paths

Trigger CI

0034d3f

incremental and non incremental metrics can lead to different optimiz…

e8760cf

…ation paths

fix: no hierarchy_t, free unfold function

7180cd6

docs

fb61051

joaquintides reviewed Feb 20, 2026

View reviewed changes

doc/louvain_clustering.html Show resolved Hide resolved

joaquintides reviewed Feb 20, 2026

View reviewed changes

doc/louvain_clustering.html Show resolved Hide resolved

joaquintides reviewed Feb 20, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

joaquintides reviewed Feb 20, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

joaquintides reviewed Feb 20, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Outdated Show resolved Hide resolved

jeremy-murphy requested changes Feb 22, 2026

View reviewed changes

include/boost/graph/louvain_clustering.hpp Show resolved Hide resolved

include/boost/graph/louvain_clustering.hpp Show resolved Hide resolved

Becheler added 4 commits February 23, 2026 15:50

index-based interals and contguous outputs labels

af23880

index-based interals and contguous outputs labels

967f47b

fix dosctrsing

bdedea7

specializing std::hash forbidden here

4b8a584

Becheler added 4 commits February 24, 2026 16:50

quality functions passed as objects with named methods

f04a9cc

default value for policy

39c4863

typo

a4e2ada

updated documentation

ca528f5

Becheler added 4 commits March 6, 2026 13:35

fix: assertion on edge added

85f0174

fix: cleanup dead internal_weights

a846cde

test: TRUST_AGGREGATED_Q path

ef7933e

fix: disentangle trust Q and track peak

79523b6

jeremy-murphy reviewed Mar 11, 2026

View reviewed changes

		std::map<VertexDescriptor, WeightType> internal_weights;
		std::map<VertexDescriptor, std::set<VertexDescriptor>> vertex_mapping;


		<H3>Parameters</H3>

		IN: <tt>const Graph& g</tt>

Conversation

Becheler commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review-driven changes implemented in this PR:

Review items still open:

Uh oh!

jeremy-murphy commented Feb 4, 2026

Uh oh!

jeremy-murphy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeremy-murphy Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Becheler Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeremy-murphy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jeremy-murphy commented Feb 24, 2026

Uh oh!

Becheler commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremy-murphy commented Feb 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Becheler commented Feb 2, 2026 •

edited

Loading

jeremy-murphy Feb 8, 2026 •

edited

Loading

Becheler Feb 9, 2026 •

edited

Loading

Becheler commented Feb 24, 2026 •

edited

Loading