Optimal layout assignation algorithm #296

lx · 2022-05-01T07:57:40Z

lx commented

2022-05-01 07:57:40 +00:00

Change the way new layout assignations are computed.

The function now computes an optimal assignation (with respect to partition size) that minimizes the distance to the former assignation, using flow algorithms.

This commit was written by Mendes Oulamara mendes.oulamara@pm.me

Change the way new layout assignations are computed. The function now computes an optimal assignation (with respect to partition size) that minimizes the distance to the former assignation, using flow algorithms. This commit was written by Mendes Oulamara <mendes.oulamara@pm.me>

lx added 2 commits 2022-05-01 07:57:45 +00:00

Change the way new layout assignations are computed.

continuous-integration/drone/push Build is failing

Details

c1d1646c4d

The function now computes an optimal assignation (with respect to partition size) that minimizes the distance to the former assignation, using flow algorithms.

This commit was written by Mendes Oulamara <mendes.oulamara@pm.me>

Apply cargo fmt

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

2aeaddd5e2

lx added 1 commit 2022-05-01 08:14:53 +00:00

updated cargo.lock

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

3ba2c5b424

Mendes added 1 commit 2022-05-01 17:15:28 +00:00

Corrected the warnings and errors issued by cargo clippy

continuous-integration/drone/pr Build is failing

Details

continuous-integration/drone/push Build is failing

Details

948ff93cf1

lx added 1 commit 2022-05-05 12:22:15 +00:00

Correct small formatting issue

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

617f28bfa4

quentin commented

2022-05-05 12:51:00 +00:00

Hi @Mendes, thanks a lot for your amazing contribution!

In order to make your PR more approachable, would you agree to share some high-level information about your algorithm in a comment of this PR?

For example, what are the state of the art algoithms/data structures/results/mathematical concepts that you used. I have seen a bipartite graph and "Dinic's max flow algorithm" in the comments for example, do you use other ones? how do they interact together?

Based on this state of the art, could you also shortly say how you leveraged them to build the final algorithm, how did you split the original layout assignation problem in smaller problems? For example, LX identified 3 major components: 1. assigning partition to zones, 2. assigning zone partitions to nodes, and 3. minimizing the amount of rebalance between the old and new layout by detecting cycles in this bipartite graph. Could you confirm that this is how your algorithm work? Can you detail what each component does?

From an engineering perspective, being able to reflect and isolate these different concerns in the code would help us understand the code, test it, and feel confident enough to maintain it!

I only quickly reviewed your file src/util/bipartite.rs and have some suggestions to illustrate this proposition. First, it seems this file contains logic on 2 different concerns, if I understand correctly. The first is for a bipartite graph built with the EdgeFlow structure that serves for the zone assignation. The 2nd is for a bipartite graph built with the WeightedEdge structure. I would say that things that go together must be grouped together, so each structure should be next to the functions that use them. I would even say they should be declared in different files.

Next, focusing on one of your function, like optimize_matching, as an engineer my first thought is that your function are long! This is not a problem per se, but it often good to question why, see for example Rule of 30 and Applications of Cyclomatic Complexity - do not apply them strictly to Rust, I think it starts being a problem after 60/80 lines, and there is always exceptions. If we take the ~30 first lines, they are here to build your bi-partite graph. Instead of putting this logic here, you could 1) define a structure for your bipartite graph outside of this function that contains the vector, nb_left and nb_right and 2) define a new or build function that will encapsulate these 30 initialization lines. Then you could write a function for the Bellman Ford algorithm, and maybe a function that output the optimal matching from the graph. Your positive_cycle function can be attached to your graph structure as a method too.

Your root function optimize_matching would be as simple as:

pub fn optimize_matching(old_match: &[Vec<usize>], new_match: &[Vec<usize>], nb_right: usize) -> Vec<Vec<usize>> {
  let g = Graph::new(old_match, new_match, nb_right);
  g.exec_bellmand_ford();
  g.build_optimal_matching()
}

What I have described here is an OOP approach to solve the problem of modularization and encapsulation. In my opinion, such properties help to make the code more readable by forcing the author to name and split the things, of course if names are meaningless and code is abitrarily split, this is useless.

This last part is just an example and probably not the best way to do it, especially because I am not familiar with the concepts. My example is just here to convey an intent, other programming paradigms are possibles and great, and like all approaches it should not be abused.

Hi @Mendes, thanks a lot for your amazing contribution! In order to make your PR more approachable, would you agree to share some high-level information about your algorithm in a comment of this PR? For example, what are the state of the art algoithms/data structures/results/mathematical concepts that you used. I have seen a bipartite graph and "Dinic's max flow algorithm" in the comments for example, do you use other ones? how do they interact together? Based on this state of the art, could you also shortly say how you leveraged them to build the final algorithm, how did you split the original layout assignation problem in smaller problems? For example, LX identified 3 major components: 1. assigning partition to zones, 2. assigning zone partitions to nodes, and 3. minimizing the amount of rebalance between the old and new layout by detecting cycles in this bipartite graph. Could you confirm that this is how your algorithm work? Can you detail what each component does? From an engineering perspective, being able to reflect and isolate these different concerns in the code would help us understand the code, test it, and feel confident enough to maintain it! I only quickly reviewed your file `src/util/bipartite.rs` and have some suggestions to illustrate this proposition. First, it seems this file contains logic on 2 different concerns, if I understand correctly. The first is for a bipartite graph built with the `EdgeFlow` structure that serves for the zone assignation. The 2nd is for a bipartite graph built with the `WeightedEdge` structure. I would say that things that go together must be grouped together, so each structure should be next to the functions that use them. I would even say they should be declared in different files. Next, focusing on one of your function, like `optimize_matching`, as an engineer my first thought is that your function are long! This is not a problem *per se*, but it often good to question why, see for example [Rule of 30](https://dzone.com/articles/rule-30-%E2%80%93-when-method-class-or) and [Applications of Cyclomatic Complexity](https://en.wikipedia.org/wiki/Cyclomatic_complexity#Applications) - do not apply them strictly to Rust, I think it starts being a problem after 60/80 lines, and there is always exceptions. If we take the ~30 first lines, they are here to build your bi-partite graph. Instead of putting this logic here, you could 1) define a structure for your bipartite graph outside of this function that contains the vector, `nb_left` and `nb_right` and 2) define a `new` or `build` function that will encapsulate these 30 initialization lines. Then you could write a function for the Bellman Ford algorithm, and maybe a function that output the optimal matching from the graph. Your `positive_cycle` function can be attached to your graph structure as a method too. Your root function `optimize_matching` would be as simple as: ```rust pub fn optimize_matching(old_match: &[Vec<usize>], new_match: &[Vec<usize>], nb_right: usize) -> Vec<Vec<usize>> { let g = Graph::new(old_match, new_match, nb_right); g.exec_bellmand_ford(); g.build_optimal_matching() } ``` What I have described here is an OOP approach to solve the problem of modularization and encapsulation. In my opinion, such properties help to make the code more readable by forcing the author to *name and split the things*, of course if names are meaningless and code is abitrarily split, this is useless. This last part is just an example and probably not the best way to do it, especially because I am not familiar with the concepts. My example is just here to convey an intent, other programming paradigms are possibles and great, and like all approaches it should not be abused.

lx reviewed 2022-05-05 15:20:08 +00:00

lx left a comment

This looks extremely promising and I'm hyped to have this in Garage soon :)

I think we identified three high-level concerns, in addition to the comments I made in the code below:

error reporting (and general reporting of what is happening)
testing on small toy examples
clarity of the code

Error reporting. For now, this code uses asserts, unwrap and panics at many places. The consistency of the results are checked at several steps using assertions, which is good. However, if we want to report to the user what is going wrong, it would be better if all of these functions could return Result types, maybe with a custom Error enum for all of the different kinds of errors that can happen at any step, or just using String as an error type. Globally, this means the following transformations:

replacing panic() by: return Err(format!("my error message"))
replacing assert!(x) by if !x { return Err(format!("my error message"))
replacing some_result.unwrap() by some_result? and some_option.unwrap() by some_option.ok_or_else(|| format!("my error message"))?

We should take this opportunity to write explicit error messages that try to tell precisely what is going wrong. If it's a situation that isn't supposed to happen (a failed assert for instance), the error message should clearly say so.

General reporting of what is happening: In the previous version, when a layout change was calculated, a list of how many partitions were moved between nodes was printed on screen by the algorithm that does the calculation, so that the sysadmin could predict how much data will be moved on the network. Having this information is important, and is missing in the new version. Moreover, instead of printing this information on the console, we would ultimately need to have it returned in a struct of some kind, so that it can also be returned to users that are calling this algorithm over the network.

Testing on small toy examples. We need a high level of assurance that this algorithm actually does what it's expected to do. To do so, we need to run it on many real-world cluster configurations, in the various settings that may appear in real-world deployments:

small clusters in only 1 zone, or bigger ones
small clusters in only 2 zones, or bigger ones
clusters in 3 zones
clusters in 4 or more zones

We need a way to run the algorihtm on such small toy examples and print the results somewhere so that we can check that the way capacities are assigned to different nodes corresponds to what we would intuitively expect (i.e. verify that the equations implemented here actually correspond to what we want). I think we won't be convinced to integrate this work until we have such toy visualizations to convince ourselves that everything is working :)

Clarity of the code. What the code is doing is not always very clear. Comments are present but a high-level view is sometimes missing. I like to have, at the beginning of a function, a high-level comment (that can be quite long) that describes what the function does, and the steps it takes to do so. It would also be nice to have comments for data structures and intermediate variables, that explain the meaning of the different possible values that are stored in the variables that are defined. (e.g. if we have a vector whose values are positions in another vector, it would be nice to have a comment saying so).

This looks extremely promising and I'm hyped to have this in Garage soon :) I think we identified three high-level concerns, in addition to the comments I made in the code below: - error reporting (and general reporting of what is happening) - testing on small toy examples - clarity of the code **Error reporting.** For now, this code uses asserts, unwrap and panics at many places. The consistency of the results are checked at several steps using assertions, which is good. However, if we want to report to the user what is going wrong, it would be better if all of these functions could return Result types, maybe with a custom Error enum for all of the different kinds of errors that can happen at any step, or just using String as an error type. Globally, this means the following transformations: - replacing `panic()` by: `return Err(format!("my error message"))` - replacing `assert!(x)` by `if !x { return Err(format!("my error message"))` - replacing `some_result.unwrap()` by `some_result?` and `some_option.unwrap()` by `some_option.ok_or_else(|| format!("my error message"))?` We should take this opportunity to write explicit error messages that try to tell precisely what is going wrong. If it's a situation that isn't supposed to happen (a failed assert for instance), the error message should clearly say so. *General reporting of what is happening:* In the previous version, when a layout change was calculated, a list of how many partitions were moved between nodes was printed on screen by the algorithm that does the calculation, so that the sysadmin could predict how much data will be moved on the network. Having this information is important, and is missing in the new version. Moreover, instead of printing this information on the console, we would ultimately need to have it returned in a struct of some kind, so that it can also be returned to users that are calling this algorithm over the network. **Testing on small toy examples.** We need a high level of assurance that this algorithm actually does what it's expected to do. To do so, we need to run it on many real-world cluster configurations, in the various settings that may appear in real-world deployments: - small clusters in only 1 zone, or bigger ones - small clusters in only 2 zones, or bigger ones - clusters in 3 zones - clusters in 4 or more zones We need a way to run the algorihtm on such small toy examples and print the results somewhere so that we can check that the way capacities are assigned to different nodes corresponds to what we would intuitively expect (i.e. verify that the equations implemented here actually correspond to what we want). I think we won't be convinced to integrate this work until we have such toy visualizations to convince ourselves that everything is working :) **Clarity of the code.** What the code is doing is not always very clear. Comments are present but a high-level view is sometimes missing. I like to have, at the beginning of a function, a high-level comment (that can be quite long) that describes what the function does, and the steps it takes to do so. It would also be nice to have comments for data structures and intermediate variables, that explain the meaning of the different possible values that are stored in the variables that are defined. (e.g. if we have a vector whose values are positions in another vector, it would be nice to have a comment saying so).

src/rpc/layout.rs Outdated

					
				@ -177,0 +185,4 @@

						//We compute the optimal number of partition to assign to

						//every node and zone.

						if let Some((part_per_nod, part_per_zone)) = self.optimal_proportions() {

lx commented

2022-05-05 12:34:44 +00:00

A whole indentation level for everything that follows can be removed by doing:

let (part_per_node, part_per_zone) = match self.optimal_proportions() {
    Some((ppn, ppz)) => (ppn, ppz),
    None => return false,
};

// the rest of the if branch same here but with one level of indent less

A whole indentation level for everything that follows can be removed by doing: ```rust let (part_per_node, part_per_zone) = match self.optimal_proportions() { Some((ppn, ppz)) => (ppn, ppz), None => return false, }; // the rest of the if branch same here but with one level of indent less ```

src/rpc/layout.rs Outdated

					
				@ -213,0 +283,4 @@

										self.ring_assignation_data.push(id as CompactNodeType);

									} else {

										panic!()

									}

lx commented

2022-05-05 12:41:25 +00:00

The if-else block can be written more simply as:

self.ring_assignation_data.push(nod.unwrap() as CompactNodeType);

You can also use .expect("message") to add a custom error message when nod is None

The if-else block can be written more simply as: ```rust self.ring_assignation_data.push(nod.unwrap() as CompactNodeType); ``` You can also use `.expect("message")` to add a custom error message when `nod` is `None`

src/rpc/layout.rs Outdated

					
				@ -239,0 +324,4 @@

					///This function compute the number of partition to assign to

					///every node and zone, so that every partition is replicated

					///self.replication_factor times and the capacity of a partition

					///is maximized.

lx commented

2022-05-05 15:04:47 +00:00

If we have nodes in only 2 zones but replicaton factor is 3, does this do the trick of adding a "ghost zone" that takes some of the capacity?

And what if we have nodes in only 1 zone?

If we have nodes in only 2 zones but replicaton factor is 3, does this do the trick of adding a "ghost zone" that takes some of the capacity? And what if we have nodes in only 1 zone?

src/rpc/layout.rs Outdated

					
				@ -239,0 +325,4 @@

					///every node and zone, so that every partition is replicated

					///self.replication_factor times and the capacity of a partition

					///is maximized.

					fn optimal_proportions(&mut self) -> Option<(Vec<usize>, HashMap<String, usize>)> {

lx commented

2022-05-05 14:57:40 +00:00

This whole function is very obscure to me :(

I tried to put some comments but I'm not sure I understood everything.

Probably a long comment at the beginning with the high-level math would help.

This whole function is very obscure to me :( I tried to put some comments but I'm not sure I understood everything. Probably a long comment at the beginning with the high-level math would help.

src/rpc/layout.rs Outdated

					
				@ -239,0 +339,4 @@

								);

							} else {

								zone_capacity.insert(node_zone[i].clone(), node_capacity[i]);

							}

lx commented

2022-05-05 12:32:21 +00:00

More concise and idiomatic version:

for (zone, capacity) in node_zone.iter().zip(node_capacity.iter()) {
    *zone_capacity.entry(zone.clone()).or_insert(0) += capacity;
}

More concise and idiomatic version: ```rust for (zone, capacity) in node_zone.iter().zip(node_capacity.iter()) { *zone_capacity.entry(zone.clone()).or_insert(0) += capacity; } ```

src/rpc/layout.rs Outdated

					
				@ -288,0 +369,4 @@

									),

								)

							})

							.collect();

lx commented

2022-05-05 12:46:12 +00:00

Not convinced this works: if I have one very big zone and two smaller zones, the largest one will be capped to exactly nb_partitions, but the smaller ones will have less than nb_partitions. But the only correct solution would have been to have exactly nb_partitions partitions in each zone. Am I correct?

src/rpc/layout.rs Outdated

					
				@ -288,0 +380,4 @@

								.keys()

								.filter(|k| part_per_zone[*k] < nb_partitions)

								.map(|k| zone_capacity[k])

								.sum();

lx commented

2022-05-05 12:47:43 +00:00

Rewrite as:

let sum_capleft: u32 = zone_capacity
    .iter()
    .filter(|(k, _v)| part_per_zone[*k] < nb_partitions)
    .map(|(_k, v)| v)
    .sum();

Rewrite as: ```rust let sum_capleft: u32 = zone_capacity .iter() .filter(|(k, _v)| part_per_zone[*k] < nb_partitions) .map(|(_k, v)| v) .sum(); ```

src/rpc/layout.rs Outdated

					
				@ -476,0 +524,4 @@

								rf == cl.ring_assignation_data[rf * i..rf * (i + 1)]

									.iter()

									.map(|nod| node_zone[*nod as usize].clone())

									.unique()

lx commented

2022-05-05 14:25:17 +00:00

ALERT ALERT ALERT

I think this is wrong: .unique() only works on a sorted iterator, because it only de-duplicates consecutive items! If I have an iterator that gives me A, B, A, then .unique().count()) will be 3 but this is clearly not how we want to count here.

**ALERT ALERT ALERT** I think this is wrong: `.unique()` only works on a sorted iterator, because it only de-duplicates consecutive items! If I have an iterator that gives me A, B, A, then `.unique().count())` will be 3 but this is clearly not how we want to count here.

src/rpc/layout.rs Outdated

					
				@ -487,0 +538,4 @@

									.filter(|x| **x == i as u8)

									.count()

							})

							.collect::<Vec<_>>();

lx commented

2022-05-05 14:29:40 +00:00

I think this can be made linear complexity instead of n * m by doing:

let mut node_nb_part = vec![0; nb_nodes];
for x in cl.ring_assignation_data.iter() {
  node_nb_part[*x as usize] += 1;
}

I didn't put comments everywhere, but I think I saw other places with double nested iterators (n * m complexity), are there any others that can be rewritten to be linear complexity?

Note that a.iter().filter(|x| b.contains(x)) is such a double nested iterator with n * m complexity, depending on the context it might be possible to rewrite them as above.

I think this can be made linear complexity instead of n * m by doing: ```rust let mut node_nb_part = vec![0; nb_nodes]; for x in cl.ring_assignation_data.iter() { node_nb_part[*x as usize] += 1; } ``` I didn't put comments everywhere, but I think I saw other places with double nested iterators (n * m complexity), are there any others that can be rewritten to be linear complexity? Note that `a.iter().filter(|x| b.contains(x))` is such a double nested iterator with n * m complexity, depending on the context it might be possible to rewrite them as above.

src/rpc/layout.rs Outdated

					
				@ -493,3 +543,1 @@

								let mut part = PartitionAss::new();

								for node_i in self.ring_assignation_data

									[i * self.replication_factor..(i + 1) * self.replication_factor]

						let zone_vec = node_zone.iter().unique().collect::<Vec<_>>();

lx commented

2022-05-05 14:34:02 +00:00

ALERT ALERT ALERT

Same remark here on .unique() (see above). What has to be done instead:

let mut zone_vec = node_zone.clone();
zone_vec.sort();
zone_vec.dedup();

**ALERT ALERT ALERT** Same remark here on `.unique()` (see above). What has to be done instead: ```rust let mut zone_vec = node_zone.clone(); zone_vec.sort(); zone_vec.dedup(); ```

src/rpc/layout.rs Outdated

					
				@ -499,0 +549,4 @@

									.filter(|x| node_zone[**x as usize] == **z)

									.count()

							})

							.collect::<Vec<_>>();

lx commented

2022-05-05 14:36:37 +00:00

This is a nested loop with n * m complexity. Fortunately, zone_vec is very small. However for cache locality it would have been better to rewrite it the other way:

let mut zone_nb_part = vec![0; zone_vec.len()];
for x in cl.ring_assignation_data.iter() {
  let i_zone = zone_vec.iter().position(|z| *z == node_zone[*x as usize]).unwrap();
  zone_nb_part[i_zone] += 1;
}

This is probably a very tiny optimization and not important at all. In fact the other version has other advantages such as not necessitating to use .unwrap(), and, depending on taste, being more readable (I don't necessarily agree)

This is a nested loop with n * m complexity. Fortunately, zone_vec is very small. However for cache locality it would have been better to rewrite it the other way: ```rust let mut zone_nb_part = vec![0; zone_vec.len()]; for x in cl.ring_assignation_data.iter() { let i_zone = zone_vec.iter().position(|z| *z == node_zone[*x as usize]).unwrap(); zone_nb_part[i_zone] += 1; } ``` This is probably a very tiny optimization and not important at all. In fact the other version has other advantages such as not necessitating to use `.unwrap()`, and, depending on taste, being more readable (I don't necessarily agree)

src/rpc/layout.rs Outdated

					
				@ -507,0 +573,4 @@

								assert!(

									node_capacity[idmin] * (node_nb_part[idnew] as u32 + 1)

										>= node_capacity[idnew] * node_nb_part[idmin] as u32

								);

lx commented

2022-05-05 14:41:00 +00:00

I think we should avoid writing if /* a very long multiline condition */ { /* sth */ } if we can.

Here I would give a temporary name to the value (0..nb_nodes).filter(...).max_by(...) and then just do if let Some(idnew) = tempvalue { ... } (don't just call it tempvalue though, that's an example)

I think we should avoid writing `if /* a very long multiline condition */ { /* sth */ }` if we can. Here I would give a temporary name to the value `(0..nb_nodes).filter(...).max_by(...)` and then just do `if let Some(idnew) = tempvalue { ... }` (don't just call it `tempvalue` though, that's an example)

src/rpc/layout.rs Outdated

					
				@ -522,0 +587,4 @@

								if let Some(idnew) = node_of_z_iter.min_by(|i, j| {

									(node_capacity[*i] * (node_nb_part[*j] as u32 + 1))

										.cmp(&(node_capacity[*j] * (node_nb_part[*i] as u32 + 1)))

								}) {

lx commented

2022-05-05 14:41:38 +00:00

Same here, the value matched in the if condition should probably have its own let binding above

Same here, the value matched in the `if` condition should probably have its own `let` binding above

src/util/bipartite.rs Outdated

					
				@ -0,0 +2,4 @@

				 * This module deals with graph algorithm in complete bipartite

				 * graphs. It is used in layout.rs to build the partition to node

				 * assignation.

				 * */

lx commented

2022-05-05 13:55:00 +00:00

For a documentation comment that concerns the whole module, use the following syntax:

//! doc comment line 1
//! doc comment line 2
//! etc

For a documentation comment that concerns the whole module, use the following syntax: ```rust //! doc comment line 1 //! doc comment line 2 //! etc ```

src/util/bipartite.rs Outdated

					
				@ -0,0 +14,4 @@

					c: i32,

					flow: i32,

					v: usize,

					rev: usize,

lx commented

2022-05-05 14:11:40 +00:00

What are the meanings of the differnt fields ? Add this as comments

src/util/bipartite.rs Outdated

					
				@ -0,0 +22,4 @@

				struct WeightedEdge {

					w: i32,

					u: usize,

					v: usize,

lx commented

2022-05-05 14:11:47 +00:00

Same

src/util/bipartite.rs Outdated

					
				@ -0,0 +29,4 @@

				 * complete bipartite graph. It returns a matching that has the

				 * same degree as new_match at every vertex, and that is as close

				 * as possible to old_match.

				 * */

lx commented

2022-05-05 13:55:53 +00:00

For a doc comment for the items that follows the comment, use the following syntax:

/// comment text
/// comment text
/// etc

When using cargo doc to generate crate documentation, these comments will then appear as documentation text.

(other comments might also be concerned by this remark)

For a doc comment for the items that follows the comment, use the following syntax: ```rust /// comment text /// comment text /// etc ``` When using `cargo doc` to generate crate documentation, these comments will then appear as documentation text. (other comments might also be concerned by this remark)

src/util/bipartite.rs Outdated

					
				@ -0,0 +45,4 @@

						for j in 0..nb_right {

							edge_vec[i * nb_right + j].u = i;

							edge_vec[i * nb_right + j].v = nb_left + j;

						}

lx commented

2022-05-05 14:13:44 +00:00

I think this whole loop would be clearer as:

let mut edge_vec = Vec::with_capacity(nb_left * nb_right);

for i in 0..nb_left {
  for j in 0..nb_right {
    edge_vec.push(WeightedEdge {
      w: -1,
      u: i,
      v: nb_left + j,
     });
  }
}

(each time we can avoid calculating array indices as a * x + b is a good thing)

I think this whole loop would be clearer as: ```rust let mut edge_vec = Vec::with_capacity(nb_left * nb_right); for i in 0..nb_left { for j in 0..nb_right { edge_vec.push(WeightedEdge { w: -1, u: i, v: nb_left + j, }); } } ``` (each time we can avoid calculating array indices as `a * x + b` is a good thing)

src/util/bipartite.rs Outdated

					
				@ -0,0 +185,4 @@

						graph[1][i].c = *c as i32;

						graph[1][i].v = i + 2 + left_cap_vec.len();

						graph[1][i].rev = 0;

					}

lx commented

2022-05-05 14:47:05 +00:00

This might be a stretch, but instead of using integers that have special meanings (0 = source, 1 = sink, etc) as vertex identifiers, why not use an enum as the following?

#[derive(Copy, PartialOrd, Ord, PartialEq, Eq)]
struct BipartiteFlowGraphVertex {
  Source,
  Sink,
  Left(usize),
  Right(usize),
}

Values of this kind behave exactly the same as integers thanks to the derive statement, so the flow algorithm's body doesn't need to be changed. It's just the end of the algorithm that needs to change, when we reconstruct the association table from the flow graph, where we need to match on the two sides of each of the edge, but that's probably quite simple to do.

Using such a strategy would reduce by A LOT the risk of doing something wrong with vertex identifiers, and would also make the code a lot more elegant.

This could also be done for the bipartite cycle optimization algorithm, with a vertex type as follows:

#[derive(Copy, PartialOrd, Ord, PartialEq, Eq)]
struct BipartiteGraphVertex {
  Left(usize),
  Right(usize),
}

The edge types could be parametric on the vertex type:

trait VertexType: Copy + PartialOrd + Ord + PartialEq + Eq {}
impl VertexType for BipartiteGraphVertex {}
impl VertexType for BipartiteFlowGraphVertex {}

//Graph data structure for the flow algorithm.
#[derive(Clone, Copy, Debug)]
struct EdgeFlow<V: VertexType> {
	c: i32,
	flow: i32,
	v: V,
	rev: V,
}

//Graph data structure for the detection of positive cycles.
#[derive(Clone, Copy, Debug)]
struct WeightedEdge<V: VertexType> {
	w: i32,
	u: V,
	v: V,
}

This might be a stretch, but instead of using integers that have special meanings (0 = source, 1 = sink, etc) as vertex identifiers, why not use an enum as the following? ```rust #[derive(Copy, PartialOrd, Ord, PartialEq, Eq)] struct BipartiteFlowGraphVertex { Source, Sink, Left(usize), Right(usize), } ``` Values of this kind behave exactly the same as integers thanks to the derive statement, so the flow algorithm's body doesn't need to be changed. It's just the end of the algorithm that needs to change, when we reconstruct the association table from the flow graph, where we need to match on the two sides of each of the edge, but that's probably quite simple to do. Using such a strategy would reduce by A LOT the risk of doing something wrong with vertex identifiers, and would also make the code a lot more elegant. This could also be done for the bipartite cycle optimization algorithm, with a vertex type as follows: ```rust #[derive(Copy, PartialOrd, Ord, PartialEq, Eq)] struct BipartiteGraphVertex { Left(usize), Right(usize), } ``` The edge types could be parametric on the vertex type: ```rust trait VertexType: Copy + PartialOrd + Ord + PartialEq + Eq {} impl VertexType for BipartiteGraphVertex {} impl VertexType for BipartiteFlowGraphVertex {} //Graph data structure for the flow algorithm. #[derive(Clone, Copy, Debug)] struct EdgeFlow<V: VertexType> { c: i32, flow: i32, v: V, rev: V, } //Graph data structure for the detection of positive cycles. #[derive(Clone, Copy, Debug)] struct WeightedEdge<V: VertexType> { w: i32, u: V, v: V, } ```

src/util/bipartite.rs Outdated

					
				@ -0,0 +306,4 @@

							}

							//otherwise, we send flow to nbd.

							lifo.push_back((graph[id][nbd].v, new_flow));

						}

lx commented

2022-05-05 14:58:26 +00:00

I'll trust you that Dinic's flow algorithm is correct here ;)

Mendes added 2 commits 2022-07-19 11:31:43 +00:00

Added the latex report on the optimal layout algorithm 03e3a1bd15

Merge branch 'optimal-layout' of https://git.deuxfleurs.fr/Deuxfleurs/garage into optimal-layout

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

7b2c065c82

Mendes added 1 commit 2022-08-19 19:23:02 +00:00

Added a first draft version of the algorithm and analysis for the non-strict mode.

continuous-integration/drone/push Build was killed

Details

continuous-integration/drone/pr Build was killed

Details

81083dd415

Mendes added 2 commits 2022-09-10 11:52:50 +00:00

ignore log files in commit d38fb6c250

Added the section with description proofs of the parametric assignment computation in the optimal layout report

continuous-integration/drone/push Build was killed

Details

continuous-integration/drone/pr Build was killed

Details

c4adbeed51

lx referenced this pull request

2022-09-14 11:26:41 +00:00

Proposal: make capacity easier to approach #357

Mendes added 2 commits 2022-10-04 16:08:38 +00:00

New version of the algorithm that calculate the layout. 7f3249a237

It takes as paramters the replication factor and the zone redundancy, computes the
largest partition size reachable with these constraints, and among the possible
assignation with this partition size, it computes the one that moves the least number
of partitions compared to the previous assignation.
This computation uses graph algorithms defined in graph_algo.rs

Correction of a few bugs in the tests, modification of ClusterLayout::check

continuous-integration/drone/push Build is pending

Details

continuous-integration/drone/pr Build is pending

Details

bd842e1388

Mendes added 1 commit 2022-10-04 16:09:52 +00:00

deleted zone_redundancy from System struct

continuous-integration/drone/push Build is pending

Details

continuous-integration/drone/pr Build is pending

Details

99f96b9564

Mendes added 1 commit 2022-10-04 16:15:38 +00:00

Merge remote-tracking branch 'origin/main' into optimal-layout

continuous-integration/drone/pr Build is failing

Details

continuous-integration/drone/push Build is failing

Details

829f815a89

Mendes added 2 commits 2022-10-05 14:05:40 +00:00

modifications in several files to : ceac3713d6

- have consistent error return types
- store the zone redundancy in a Lww
- print the error and message in the CLI (TODO: for the server Api, should msg be returned in the body response?)

Added a CLI command to update the parameters for the layout computation (for now, only the zone redundancy)

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

a951b6c452

Mendes added 1 commit 2022-10-06 10:56:34 +00:00

Corrected two bugs:

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

9407df60cc

- self.node_id_vec was not properly updated when the previous ring was empty
- ClusterLayout::merge was not considering changes in the layout parameters

Mendes added 3 commits 2022-10-10 15:25:22 +00:00

corrected warnings of cargo clippy 911eb17bd9

Tests written in layout.rs fcf9ac674a

added staged_parameters to ClusterLayout
removed the serde(default) -> will need a migration function

cargo fmt

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

4abab246f1

Mendes added 2 commits 2022-10-13 08:34:03 +00:00

Improved the statistics displayed in layout show e5664c9822

corrected a few bugs

Added some comment

continuous-integration/drone/pr Build is failing

Details

continuous-integration/drone/push Build encountered an error

Details

bcdd1e0c33

lx added 1 commit 2022-10-13 10:41:22 +00:00

rm .gitattributes

continuous-integration/drone/pr Build is failing

Details

continuous-integration/drone/push Build is failing

Details

3039bb5d43

lx force-pushed optimal-layout from ee08c3c850 to 3039bb5d43

2022-10-13 10:43:06 +00:00

Compare

lx added this to the v0.9 milestone 2022-10-16 19:12:49 +00:00

lx added 1 commit 2022-11-07 11:21:09 +00:00

Merge branch 'main' into optimal-layout

continuous-integration/drone/pr Build is failing

Details

continuous-integration/drone/push Build is failing

Details

28d7a49f63

lx force-pushed optimal-layout from 161a6ce463 to 74f6056a9b

2022-11-07 19:06:18 +00:00

Compare

lx force-pushed optimal-layout from 74f6056a9b to ea5afc2511

2022-11-07 19:11:37 +00:00

Compare

lx added 1 commit 2022-11-07 19:29:41 +00:00

Ensure .sort() is called before counting unique items

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

fd5bc142b5

lx added 1 commit 2022-11-07 20:12:24 +00:00

Use bytes as capacity units

continuous-integration/drone/push Build is failing

Details

continuous-integration/drone/pr Build is failing

Details

73a4ca8b15

lx force-pushed optimal-layout from 8cc2f4065e to d75b37b018

2022-11-08 13:58:57 +00:00

Compare

lx force-pushed optimal-layout from 6c4dae9564 to 0fba0317ab

2022-11-08 14:14:38 +00:00

Compare

lx force-pushed optimal-layout from 0fba0317ab to fc2729cd81

2022-11-08 14:19:51 +00:00

Compare

lx added 1 commit 2022-11-08 14:39:07 +00:00

Fix HTTP return code

continuous-integration/drone/push Build is passing

Details

continuous-integration/drone/pr Build is passing

Details

217abdca18

lx added 1 commit 2022-11-08 15:15:59 +00:00

Slightly simplify code at places

continuous-integration/drone/push Build is passing

Details

continuous-integration/drone/pr Build is passing

Details

ec12d6c8dd

lx referenced this pull request

2022-11-16 10:47:27 +00:00

Panic during application of new cluster layout in 0.8.0-rc1 #414

lx referenced this pull request

2022-11-21 13:03:53 +00:00

Panic during application of new cluster layout in 0.8.0-rc1 #414

lx referenced this pull request

2022-11-21 21:44:48 +00:00

Panic during application of new cluster layout in 0.8.0-rc1 #414

lx added 1 commit 2022-12-11 17:30:17 +00:00

itertools .unique() doesn't require sorted items

continuous-integration/drone/push Build is passing

Details

continuous-integration/drone/pr Build is failing

Details

9d83364ad9

lx changed target branch from main to next

2022-12-11 17:32:02 +00:00

lx changed target branch from next to main

2022-12-11 17:36:38 +00:00

lx changed target branch from main to next

2022-12-11 17:37:33 +00:00

lx changed title from ~~WIP: Optimal layout assignation algorithm~~ to Optimal layout assignation algorithm

2022-12-11 17:41:36 +00:00

lx merged commit 6e44369cbc into next

2022-12-11 17:41:53 +00:00

lx referenced this pull request from a commit

2022-12-11 17:41:53 +00:00

Merge pull request 'Optimal layout assignation algorithm' (#296) from optimal-layout into next

lx commented

2022-12-11 17:42:21 +00:00

Anything that needs to be finalized on this feature before release of v0.9 will now be done on branch next.

Anything that needs to be finalized on this feature before release of v0.9 will now be done on branch `next`.

lx referenced this pull request

2023-09-04 09:39:18 +00:00

Garage v0.9 #473

mitchty referenced this pull request

2023-11-16 01:59:04 +00:00

Exceedingly slow performance for s3fs and garage 0.8.2 #668

baptiste referenced this pull request from a commit

2025-01-27 18:40:44 +00:00

WIP: fix crash in layout computation when changing all nodes of a zone to gateway mode

baptiste referenced this pull request

2025-01-27 18:41:06 +00:00

fix crash in layout computation when changing all nodes of a zone to gateway mode #937

Sign in to join this conversation.

No reviewers

No milestone

No project

No assignees

3 participants

Notifications

Due date

The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: Deuxfleurs/garage#296

Rows
Columns