July 16th, 2024

XLSTMTime: Long-Term Time Series Forecasting with xLSTM

The paper introduces xLSTM, an architecture for long-term time series forecasting. It addresses transformer model challenges, showing superior performance in real-world datasets. Authors: Musleh Alharthi and Ausif Mahmood.

Read original article

FinanceSkepticismCuriosity

The paper titled "xLSTMTime: Long-term Time Series Forecasting With xLSTM" introduces an extended LSTM (xLSTM) architecture for multivariate long-term time series forecasting (LTSF). This new model addresses challenges faced by transformer-based models, such as high computational demands and difficulty in capturing temporal dynamics. The xLSTM architecture incorporates exponential gating and a revised memory structure with higher capacity, showing promising potential for LTSF tasks. The study compares xLSTMTime's performance with state-of-the-art models across various real-world datasets, demonstrating superior forecasting capabilities. The findings suggest that refined recurrent architectures like xLSTMTime could provide competitive alternatives to transformer-based models in LTSF, potentially reshaping the landscape of time series forecasting. The paper is authored by Musleh Alharthi and Ausif Mahmood, and it is available for further exploration through the provided DOI link.

xLSTM Explained in Detail

Maximillan Beck's YouTube video delves into XLSTM as a Transformer alternative in language modeling. XLSTM combines LSTM and modern techniques to tackle storage and decision-making issues, aiming to rival Transformers in predictive tasks.

The Illustrated Transformer

Jay Alammar's blog explores The Transformer model, highlighting its attention mechanism for faster training. It outperforms Google's NMT in some tasks, emphasizing parallelizability. The blog simplifies components like self-attention and multi-headed attention for better understanding.

Math Behind Transformers and LLMs

This post introduces transformers and large language models, focusing on OpenGPT-X and transformer architecture. It explains language models, training processes, computational demands, GPU usage, and the superiority of transformers in NLP.

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

The paper introduces Test-Time Training (TTT) layers for sequence modeling, featuring linear complexity and self-supervised learning for training on test sequences. TTT-Linear outperforms Transformer, while TTT-MLP shows potential for long contexts.

Transformer Layers as Painters

The study "Transformer Layers as Painters" by Qi Sun et al. delves into transformer models, showcasing layer impact variations and potential for model optimization through strategic layer adjustments.

AI: What people are saying

The article on xLSTM for long-term time series forecasting generates diverse reactions.

Some professionals in economics and finance resonate with the value of LSTM models over transformers.
Questions arise about the general superiority of transformer models compared to non-deep learning models.
There is curiosity about the applicability of xLSTM to financial trading and event classification in time series.
Skepticism exists regarding the effectiveness of published time series forecasting methods in unpredictable domains like the stock market.
Some comments humorously misinterpret or criticize the potential misuse of the model in financial contexts.

11 comments

By @carbocation - 10 months

> In recent years, transformer-based models have gained prominence in multivariate long-term time series forecasting

Prominence, yes. But are they generally better than non-deep learning models? My understanding was that this is not the case, but I don't follow this field closely.

By @dkga - 10 months

A part of my work is literally building nowcasting and other types of prediction models in economics (inflation, GDP etc) and finance (market liquidity, etc). I haven’t yet had a chance to read the paper but overall the tone of “transformers are great for what they do but LSTM-type of models are very valuable still” completely resonates with me.

By @dlojudice - 10 months

Is this somehow related to the Google weather prediction model using AI [1]?

https://deepmind.google/discover/blog/graphcast-ai-model-for...

By @Dowwie - 10 months

marketed as a forecasting tool, so is this not applicable to event classification in time series?

By @_0ffh - 10 months

Too bad the dataset link in the paper isn't working. I hope that'll get amended.

By @greatpostman - 10 months

The best deep learning time series models are closed source inside hedge funds.

By @localfirst - 10 months

time series forecasting works best with deterministic domains. none of the published LLM/AI/Deep/Machine techniques do well in the stock market. Absolutely none. we've tried them all.

By @optimalsolver - 10 months

Reminder: If someone's time series forecasting method worked, they wouldn't be publishing it.

By @nyanpasu64 - 10 months

I misread this as XSLT :')

By @thedudeabides5 - 10 months

cant wait for someone to lose all their money trying to predict stocks with this thing

By @brcmthrowaway - 10 months

Wow, is there a way to apply this to financial trading?

XLSTMTime: Long-Term Time Series Forecasting with xLSTM

Related

xLSTM Explained in Detail

The Illustrated Transformer

Math Behind Transformers and LLMs

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Transformer Layers as Painters

Related

xLSTM Explained in Detail

The Illustrated Transformer

Math Behind Transformers and LLMs

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Transformer Layers as Painters