`spilled()` is very slow

I am the author of the [markov_str](https://github.com/Brogolem35/markov_str) library and I trying out the `SmallVec` to speed up the creation of my Markov Chains. But I have encountered this issue while benchmarking with [cargo flamegraph](https://github.com/flamegraph-rs/flamegraph): equvalance checks between `SmallVec`s take too long and almost all the time is spent on `spilled()`.

Example:

MarkovChain type:
```rust
pub struct MarkovChain {
	items: HashMap<SmallVec<[Spur; 4]>, ChainItem>,
	state_size: usize,
	regex: Regex,
	cache: Rodeo,
}
```

benchmarked function:
```rust
	/// Adds text as training data. The tokens will be created with the regex of the MarkovChain.
	pub fn add_text(&mut self, text: &str) {
		let tokens: Vec<Spur> = self
			.regex
			.find_iter(text)
			.map(|t| self.cache.get_or_intern(t.as_str()))
			.collect();

		// vec.windows(0) panics for some reason.
		if tokens.is_empty() {
			return;
		}

		// Creating a preallocated buffer and filling and cleaning it instead of creating a new one every loop is way more efficient.
		let mut prevbuf: SmallVec<[Spur; 4]> = SmallVec::with_capacity(self.state_size);
		for win in tokens.windows(tokens.len().min(self.state_size + 1)) {
			let rel = win.last().unwrap();

			for i in 1..win.len() {
				prevbuf.clear();
				for t in win.iter().rev().skip(1).take(i).rev() {
					prevbuf.push(*t);
				}

				match self.items.raw_entry_mut().from_key(&prevbuf) {
					RawEntryMut::Occupied(mut view) => {
						view.get_mut().add(*rel);
					}
					RawEntryMut::Vacant(view) => {
						view.insert(prevbuf.clone(), ChainItem::new(*rel));
					}
				}
			}
```

Crates used:
```toml
hashbrown = "0.15.0"
lasso = {version = "0.7.3", features = ["ahasher", "inline-more"]}
rand = "0.8.5"
regex = "1.11.0"
smallvec = "1.13.2"
```

Flamegraph output:
![image](https://github.com/user-attachments/assets/8a7181c0-e9e6-4e6a-99a8-839b90b5b95a)


The code and sample data I used for this benchmark can be found at [Brogolem35/markov_str/](https://github.com/Brogolem35/markov_str/commit/93220d3ff6463fecebbb2f96d92b4f74d51837a6).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`spilled()` is very slow #361

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

spilled() is very slow #361

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`spilled()` is very slow #361