# Request to Include TTM-R1 and TTM-R2 Models in Comparison #244
Replies: 2 comments 2 replies
-
Hey, thank you for the suggestion. We don't update the benchmark in this repo often but you might want to check the FEV leaderboard that we just released (cc @shchur). Currently, that includes Benchmark II from the paper and we might consider adding other models such as TTM into the mix. That said, a direct comparison with TTM may be a little bit unfair (for TTM) since it seems to have been mainly developed for relatively high frequencies (hourly and higher). You might want to check out the GIFT-Eval leaderboard which includes TTM and several other models including Chronos, Chronos-Bolt, Moirai and TimesFM. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the response and for pointing me to the GIFT-Eval leaderboard! I spent some time looking through it, and it’s clear that Chronos-Bolt is an absolute beast, especially in minutely and hourly CRPS/MASE metrics. I can see why it’s such a strong choice for high-frequency time series tasks. That got me thinking - how would TTM (especially TTM-R2) stack up in this setup? Since TTM is specifically designed for high-frequency data, I feel like it could offer some interesting trade-offs compared to Chronos-Bolt, especially given how lightweight it is. GIFT-Eval seems like a perfect framework to explore this, so I wanted to ask: is there any chance you might include TTM in the leaderboard at some point? I think it would add a lot of value to see how it performs in these benchmarks. If it’s not something you’re planning to add, would you have any recommendations for how I could test it myself? Specifically: What dataset(s) would you recommend for a fair comparison? Should I stick with something from the GIFT-Eval benchmarks, or is there a better option for this kind of evaluation? Looking forward to hearing your thoughts! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Request to Include TTM-R1 and TTM-R2 Models in Comparison
Context
Сurrently evaluate various time series forecasting models as shown in the comparison chart. These models include local, task-specific, and pretrained options. However, the evaluation does not yet include the TinyTimeMixers (TTM-R1 and TTM-R2) models, which are lightweight and pretrained models developed by IBM Research for multivariate time series forecasting. Including these models in the comparison would provide a more comprehensive analysis and benchmark their performance against the current state-of-the-art.
Why Include TTM-R1 and TTM-R2?
Proposed Actions
Resources
Expected Outcome
Inclusion of TTM models in the comparison framework will:
Beta Was this translation helpful? Give feedback.
All reactions