时间正则化矩阵分解_JupyterNotebook_Python

共11个文件

py：5个

ipynb：5个

md：1个

版权申诉

188 浏览量 2023-04-13 23:56:21 上传评论收藏 582KB ZIP 举报

时间正则化矩阵分解（Temporal Regularized Matrix Factorization, TRMF）是一种先进的数据分析技术，尤其在推荐系统、时间序列预测和信号处理等领域有着广泛应用。它结合了矩阵分解和时间序列模型，通过引入时间依赖项来捕捉数据随时间变化的动态模式。在矩阵分解中，我们通常有一个大矩阵$R$，它表示用户对物品的评分或交互情况。TRMF的目标是将这个矩阵分解为两个低秩矩阵$U$和$V$，使得$R \approx UV^T$。这有助于发现隐藏的用户兴趣和物品属性，并可以用于预测未观察到的数据。具体来说，TRMF模型的优化目标通常包括以下部分： 1. **重构误差**：这是基本的矩阵分解损失，即实际值与预测值之间的差异，通常用均方误差（MSE）来度量。 2. **时间平滑项**：为了捕捉时间上的连续性，模型引入了一个正则项，通常采用指数衰减函数，使相邻时间步的预测值尽可能接近。 3. **正则化项**：为了防止过拟合，模型还可能包含对$U$和$V$的范数约束，如L1或L2正则化。在Python中实现TRMF，Jupyter Notebook是一个理想的选择，因为它提供了交互式环境，可以方便地编写代码、运行实验和可视化结果。`trmf-master`文件夹很可能包含了完整的TRMF实现，包括数据预处理、模型训练、预测和评估等步骤。以下是可能的实现流程： 1. **数据预处理**：你需要加载和处理数据集，将时间序列数据转换为矩阵形式。 2. **参数设置**：确定矩阵分解的秩（rank），时间平滑项的衰减率（alpha），以及正则化参数（lambda）等超参数。 3. **模型构建**：使用适当的库（如`numpy`或`scikit-learn`的扩展）来实现TRMF模型。 4. **模型训练**：通过迭代优化算法（如交替最小二乘法或随机梯度下降）来学习$U$和$V$的系数。 5. **预测**：使用训练好的模型对未观测到的时间点进行预测。 6. **评估**：计算预测值与真实值的误差，评估模型性能，可能使用RMSE或其他指标。 7. **可视化**：可选择性地绘制预测值与实际值的对比图，以直观展示模型效果。在Jupyter Notebook中，你可以逐步完成这些步骤，并随时检查结果。此外，你还可以进行参数调优，探索不同时间窗口大小和正则化强度对模型性能的影响。总结来说，时间正则化矩阵分解是矩阵分解的一个增强版本，特别适用于处理具有时间动态性的数据。通过结合Jupyter Notebook和Python，我们可以高效地实现、测试和优化TRMF模型，为各种时间序列分析任务提供强大工具。

资源推荐

资源详情

资源评论

收起资源包目录

时间正则化矩阵分解_Jupyter Notebook_Python_下载.zip （11个子文件）

trmf-master

synthetic_data.py 2KB

experiments_synthetic.ipynb 204KB

experiments_electricity.ipynb 76KB

experiments_missings.ipynb 76KB

workbook.ipynb 400KB

Forecast.py 3KB

trmf.py 8KB

Metrics.py 458B

RollingCV.py 2KB

experiments_crypto.ipynb 52KB

README.md 5KB

# Temporal Regularized Matrix Factorization Project was inspired by the paper: Yu, H. F., Rao, N., & Dhillon, I. S. (2016). Temporal regularized matrix factorization for high-dimensional time series prediction. In Advances in neural information processing systems (pp. 847-855). Which can be found there: http://www.cs.utexas.edu/~rofuyu/papers/tr-mf-nips.pdf ## 1. Problem description We have N timeseries of length T which are presented by matrix Y. We want to factorize it <a href="https://www.codecogs.com/eqnedit.php?latex=$Y&space;=&space;F\times&space;X$" target="_blank"><img src="https://latex.codecogs.com/gif.latex?$Y&space;=&space;F\times&space;X$" title="$Y = F\times X$" /></a>. To solve this problem we will minimize: <a href="https://www.codecogs.com/eqnedit.php?latex=$$\min\limits_{F,X}\sum\limits_{(i,t)\in\Omega}\left(Y_{it}-f_i^Tx_t\right)^2+\lambda_fR_f(F)+\lambda_xR_x(X).$$" target="_blank"><img src="https://latex.codecogs.com/gif.latex?$$\min\limits_{F,X}\sum\limits_{(i,t)\in\Omega}\left(Y_{it}-f_i^Tx_t\right)^2+\lambda_fR_f(F)+\lambda_xR_x(X).$$" title="$$\min\limits_{F,X}\sum\limits_{(i,t)\in\Omega}\left(Y_{it}-f_i^Tx_t\right)^2+\lambda_fR_f(F)+\lambda_xR_x(X).$$" /></a> By doing that we will find latent embedding vectors for timeseries and latent temporal embeddings for timepoints. One can further use this embeddings to forecast new data or to impute missings. ## 2. Package description Package consists of: - trmf : time series modelling - synthetic_data : data generation for experiments additionaly: - Metrics : metrics for experiments and validation - Forecast : simple models for testing experiments - RollingCV : rolling cross-validation implementation For usage information use help(trmf) ## 3. Experiments In experiments_[something].ipynb you can find some experiments on the package: 1) experiments_synthetic.ipynb: testing trmf model against other simple model on synthetic data **Lags = {1}** |horizon| 1 | 5 | 10 | 20 | |------|------|------|------|------| | Naive | **0.105**/**0.138** | **0.151**/**0.2** | **0.175**/**0.23** | 0.38/**0.477** | | Mean | 1.0/1.136 | 1.0/1.114 | 1.0/1.094 | 1.0/1.079 | | AutoRegression | 0.107/0.142 | 0.16/0.215 | 0.2/0.275 | 0.42/0.536 | | TRMF | 0.172/0.218 | 0.155/0.227 | 0.197/0.261 | **0.368**/0.48 | **Lags = {1,7}** |horizon| 1 | 5 | 10 | 20 | |------|------|------|------|------| | Naive | 0.82/0.956 | 1.072/1.292 | 0.893/1.119 | 1.051/1.303 | | Mean | 1.0/1.176 | 1.0/1.219 | 1.0/1.236 | 1.0/1.259 | | AutoRegression | **0.503**/**0.581** | **0.496**/**0.599** | 0.572/0.717 | **0.86/1.107** | | TRMF | 0.515/0.612 | 0.498/0.603 | **0.565/0.704** | 0.87/1.117 | **Lags = {1,7,14,28}** |horizon| 1 | 5 | 10 | 20 | |------|------|------|------|------| | Naive | 1.012/1.191 | 0.97/1.18 | 0.968/1.202 | 0.917/1.162 | | Mean | 1.0/1.164 | 1.0/1.218 | 1.0/1.206 | 1.0/1.197 | | AutoRegression | **0.618**/0.733 | **0.506**/**0.648** | **0.567**/**0.715** | 0.619/0.755 | | TRMF | 0.633/**0.662** | 0.544/0.676 | 0.578/0.726 | **0.582**/**0.72** | 2) experiments_electricity.ipynb: testing trmf model against other simple model on electricity data | horizon | 1 | 5 | 10 | 20 | |------|------|------|------|------| | Naive | **0.344/0.5** | 0.688/0.951 | 1.091/1.429 | 1.363/1.73 | | Mean | 1.0/1.19 | 1.0/1.201 | 1.0/1.204 | 1.0/1.188 | | AutoRegression | 0.427/0.557 | **0.612/0.831** | **0.627/0.876** | **0.58**/0.802 | | TRMF | 0.639/0.828 | 0.727/0.95 | 0.681/0.936 | 0.584/**0.799** | 3) experiments_crypto.ipynb: testing trmf model against other simple model on crypto-currency data | horizon | 1 | 5 | 10 | 20 | |------|------|------|------|------| | Naive | **0.158/0.36** | **0.228/0.574** | **0.29/0.644** | **0.347/0.842** | | Mean | 1.0/1.293 | 1.0/1.265 | 1.0/1.24 | 1.0/1.322 | | AutoRegression | 0.168/0.368 | 0.258/0.6 | 0.369/0.792 | 0.528/1.309 | | TRMF | 0.233/0.437 | 0.273/0.619 | 0.332/0.668 | 0.429/0.957 | 4) experiments_missings.ipynb: testing trmf model against other simple model on missing data imputation | missings | 5% | 10% | 25% | |----------|----|-----|-----| | Naive | 0.367/0.574 | 0.373/0.584 | 0.391/0.613 | | Mean | 129.506/150.367 | 108.291/125.712 | 89.242/103.586 | | TRMF | **0.359/0.516** | **0.36/0.519** | **0.361/0.52** | More details in notebooks. ## 4. Conclusion 1) TRMF model needs additional regularization on matrix W (sum of the row elements must be close to one). Otherwise, predictions for long periods will be unstable; 2) Every timeseries is better to be normalized before using TRMF; 3) TRMF is good on data imputation; 4) TRMF is good when data has missings. ## 5. Plan 1) Article analysis // done 2) Synthetic Data Generator // done 3) Basic realization of trmf with gradient descent // done 4) Documentation and help functions // done 5) Experiments on synthetic data (vs other models) // done 6) Rolling CV functionality // done 7) Experiments on electricity data (vs other models) // done 8) CryptoCurrency forecasting (vs other models) // done 9) Missing data handling // done 10) Missing data imputation experiments (vs other models) // done

评论收藏

内容反馈

版权申诉