som.zip_SOM_selforganization_self-organisingmaps

共12个文件

m：12个

版权申诉

79 浏览量 2022-09-19 17:59:56 上传评论收藏 13KB ZIP 举报

自我组织映射（Self-Organizing Maps, SOM）是由芬兰科学家Teuvo Kohonen提出的一种无监督学习算法，常用于数据可视化和降维。SOM是一种人工神经网络模型，它能够将高维输入数据投射到低维空间，同时保持输入数据的拓扑结构。这个过程称为自组织，因为它在没有明确的目标函数的情况下进行，网络结构是根据输入数据的相互关系自我调整的。在SOM中，神经元通常排列在一个二维网格上，也称为地图或映射。每个神经元都有一个权重向量，这些权重在学习过程中会不断更新以适应输入数据。当新的输入样本到达时，网络找到与该样本最接近的神经元，即最佳匹配单元（BMU, Best Matching Unit），然后更新与其相邻神经元的权重，以使整个网络更好地反映输入数据的分布。标签中的“self_organization”指的是SOM的自组织特性，即网络结构是根据输入数据的内在结构自发形成的，而不是通过预设规则或人为设定。而“self-organising_maps”则直接指代了这种类型的神经网络模型。 “som.zip”文件包含了一系列与SOM相关的MATLAB代码，如： 1. `som.m`：这是主函数，可能包含了SOM算法的实现，包括初始化权重、训练过程和结果输出。 2. `som_movie.m`：这个函数可能用于创建动画展示SOM训练过程，通过连续帧展示权重如何随时间演变。 3. `som_plot.m`：可能用于绘制SOM的结果图，如显示训练后的权重分布或者映射到二维平面的输入数据。 4. `som_animate.m`：与`som_movie.m`类似，可能用于制作SOM训练过程的动态可视化。 5. `som_plot_stats.m`：可能用于绘制SOM的统计图表，比如聚类分析或者距离矩阵等。 6. `som_statistics.m`：可能包含了计算SOM性能的统计函数，如误差平方和、均方根误差等。 7. `load_som.m`：可能是加载之前保存的SOM模型或数据的函数。 8. `project2.m`：可能用于将高维数据投影到二维空间，以便于用SOM处理。 9. `clustered.m`：可能涉及到对SOM训练结果的聚类分析，识别不同的数据群体。 10. `norm_circle.m`：可能是一个辅助函数，用于规范化数据或在二维平面上绘制单位圆。通过这些MATLAB脚本，用户可以对输入数据进行自组织映射，观察其在低维空间的结构，并进行可视化分析和聚类操作。这些工具对于理解复杂数据集的拓扑结构、发现数据模式以及进行非线性降维具有重要意义。在实际应用中，SOM常用于数据探索、特征提取、图像分类、市场细分等领域。

资源推荐

资源详情

资源评论

收起资源包目录

som.zip （12个子文件）

norm_circle.m 636B

project2.m 970B

som_animate.m 5KB

som_movie.m 5KB

clustered.m 743B

som_plot_stats.m 4KB

load_som.m 1KB

som_plot.m 5KB

distance.m 195B

som_statistics.m 2KB

make_movie.m 270B

som.m 6KB

% som a self-organizing-map using a circular network topology. % w = som(m,init_n,dec_n,init_d,Input,File) % returns w, the final weight vector for each node in a circular ring % topology, t: the number of iterations and e the squared average distance % at termination. Given input a p-by-n matrix of input vectors of dimension % n, the network w is a 2-layer net constructed as a m-by-n matrix such % that node i's weight vector is w(i,[1-n]). At each presentation of % an input vector i(l), the winner w(j) is found and the winner and % neighbors (according to the current distance function) are updated. % Uses a circular ring topology such that node i is adjacent to node j if % i = j+1 or i=j-1 and node 1 is adajacent to node m. % % NOTE: this version of som expects input to be of unit length and as % such calculates the winner wj = max(i(l)*w). Furthermore, this som % normalizes the weight vectors after each presentation such that they % lie on a unit circle. % % Additionally, writes in File (binary format) the weight vectors for % each time t=0..termination. The File has the following format: % The header: % - a 1-by-3 matrix [p,n,m] where p is the number of input vectors, n is % the dimenionsion of the input vectors and m is the number of output % nodes. % - a p-by-n matrix defining the input vectors % For each t=0..termination % - a 1-by-5 matrix [t,nt,dt,e,j] where t is the time, nt is the learning % rate, dt is the distance e is the squared euclidean distance of % the selected input from the winner, and j is the winning node % - a m-by-n matrix w, the weight vectors at time t % % Input: % m - the number of nodes (output) % init_n - the learning rate n at time t=0 % dec_n - amount to decrease n after each epoch % init_d - the distance d at time t=0 % Input - the input vectors % File - the name of a file to write the weights at each time to. This % file will be written in binary. % % NOTES: % - Expects a function dt = distance(t) to be defined on the path or % in the local directory that determines the distance dt given the % current time. % - This SOM algorithm has been left as general as possible but for the % problem description, (a unit circle), the function norm_circle expects % 2-dimensional inputs. % - Under the assumption that the number of nodes in the output layer is % relatively small, som uses the following weight update rule: % w = w + nt * (il-w) * nj % where nj is 1 for a neighbor of winner or zero otherwise so that the % effect of the rule can be defined as % wj = wj + nt*(il-wj) if j is a neighbor of winner % wj = wj otherwise % By doing so, neighbors of the winner do not have to removed from w % updated and reinserted. If however, the number of nodes is quite % large, it may be less computationally expensive to only calculate % il-w for neighbor nodes. % % Author: Dale Patterson % $Version: 1.4.3 $ $Date: 3.30.06 $ % function w = som(m,nt,dec_n,dt,Training,File) % attempt to open the output file [fid,msg] = fopen(File,'wb'); if fid == -1 error('project2:cannotOpenReportFile',fmsg); end % generate the initial weight vectors % w is a m-by-n matrix such that w(i,[1-n] is the weight vector of node i % weights are generated randomly in the range [-1,1] and normalized [p,n] = size(Training); % get # of samples and dimension w = norm_circle(-1 + 2.*rand(m,n)); % and generate w % since we're using matrix subtraction, we'll make a multidimensional array % I of size m-by-n-by-p where each I(:,:,p) is a m-by-n matrix A where for % each row i (i=1..m) in A equals Training(p,:), that is, each input vector % is repeated m times I = ones(m,n,p); for i=1:p I(:,:,i) = ones(m,1) * Training(i,:); end % write network parameters, input and initial weight vectors t = 0; % start at time t=0 err = Inf; % and err as infinity fwrite(fid,[p,n,m]); % as a uchar fwrite(fid,Training,'double'); % use double for precision fwrite(fid,[t,nt,dt,err,NaN],'double'); fwrite(fid,w,'double'); % loop till computational bounds are exceeded % we'll terminate after 120 epochs, for this problem, that means 20 epochs % at d=0 and n=0.005 while t < p*120 % permutate the input so that there's a different order for each epoch i = randperm(p); % i is a random permuatation of 1..p, use i to index I err = 0; % loop over the input (an epoch) for l=1:p % update the time t = t + 1; il = I(:,:,i(l)); % select an input sample % calculate distance, from i to each node dist = il-w; % distance from i to each node [v,j] = min(sum(dist.^2,2)); % j: winner index, v: sq. euc. dist. nj = neighbors(j,m,dt) * ones(1,n);% nj: 1s for neighbors, 0s otherwise err = err + v; % tally the error % update weight of winner and neighbors, and normalize w = norm_circle(w + nt * dist .* nj); % write period parameters and weight vectors to file fwrite(fid,[t,nt,dt,v,j],'double'); fwrite(fid,w,'double'); end % calculate the error; err = err / p; % update distance and learning rate if nt - dec_n > 0 nt = nt - dec_n; end dt = distance(t); end % clean up fclose(fid); % and close the file %%%% END som %%%% %%%% BEGIN SUB PROCEDURES %%%% % neighbors calculates the neighbors of a given node in a cirular ring topology. % n = neighbors(index,numNodes,distance) % returns n a column vector of 0 or 1, {0=not a neighbor, 1=neighbor} % for the node at index given a vector of numNodes with the given % distance using a circular ring topology function n = neighbors(i,ns,d) n = zeros(ns,1); % initialize a vector of zeros n(mod((-d:d)+(i-1),ns)+1) = 1; % and set to 1 each neighbor of i (inclusive 1)

评论收藏

内容反馈

版权申诉