[B! scikit-learn] xefのブックマーク

Speeding up scikit-learn workflow using a high-performance Go proxy. — ioloop.io

xef 2016/12/20

リンク

Machine Learning Recipes with Josh Gordon

Josh Gordon is cooking up some Machine Learning models from scratch. Learn how to use open source libraries to create ML recipes that can be adapted to a wid...

xef 2016/04/18

google developersの短く見れるレクチャー。しかし動画で使ってるの2.x系…

リンク

megaman: Manifold Learning with Millions of points

Manifold Learning is a class of algorithms seeking a low-dimensional non-linear representation of high-dimensional data. Thus manifold learning algorithms are, at least in theory, most applicable to high-dimensional data and sample sizes to enable accurate estimation of the manifold. Despite this, most existing manifold learning implementations are not particularly scala ble. Here we present a Pyth

xef 2016/03/10

リンク

[1407.7502] Understanding Random Forests: From Theory to Practice

Data analysis and machine learning have become an integrative part of the modern scientific methodology, offering automated procedures for the prediction of a phenomenon based on past observations, unraveling underlying patterns in data and providing insights about the probl em. Yet, caution should avoid using machine learning as a black-box tool, but rather consider it as a methodology, with a rat

xef 2014/11/01

リンク

Kevin McAlear » Building Hater News | Thoughts

Hater News Haterz gonna hate. But now you know who the haterz are. I wanted to use data science / machine learning to identify and rank haters by their “hater” level throughout the internet. I started with Hacker News ( check out Hater News ) and I wanted to explain the how and what I’ve done so far. This post is long and detailed but I’ve tried to explain and post my code for how I’ve built out t

xef 2014/10/28

リンク

Entry Point Data

In this short tutorial I want to provide a short overview of some of my favorite Python tools for common procedures as entry points for general pattern classification and machine learning tasks, and various other data analyses. Sections Sections Installing Python packages About the dataset Downloading and saving CSV data files from the web Reading in a dataset from a CSV file Visualizating of a da

xef 2014/06/27

リンク

Optimizing Memory Usage Of Scikit-Learn Models Using Succinct Tries

Extract Summit returns! Join global data leaders in Austin (Sept 24–25) & Dublin (Nov 5–6) for expert-led scraping sessions & networking.

xef 2014/04/04

scikit-learn

リンク

ŷhat | Predicting customer churn with scikit-learn

Customer Churn "Churn Rate" is a business term describing the rate at which customers leave or cease paying for a product or service. It's a critical figure in many businesses, as it's often the case that acquiring new customers is a lot more costly than retaining existing ones (in some cases, 5 to 20 times more expensive). Understanding what keeps customers engaged, therefore, is incredibly valua

xef 2014/03/24

scikit-learn

リンク

多層パーセプトロンで手書き数字認識 - 人工知能に関する断創録

多層パーセプトロンが収束する様子（2014/1/23）の続き。数字認識は前にニューラルネットによるパターン認識（2005/5/5）をJavaで作りましたが今回はPythonです。今回は、多層パーセプトロンを用いて手書き数字を認識するタスクを実験します。今回からscikit-learnというPythonの機械学習ライブラリを活用しています。ただ、scikit-learnには多層パーセプトロンの正式な実装はない*1ため多層パーセプトロンのスクリプトはオリジナルです。今回から比較的大きなデータを扱うためなるべく高速に動作し、かつPRMLと変数名を合わせることで理解しやすいようにしました。 digitsデータ手書き数字データは、MNISTというデータが有名です。PRMLの付録Aでも紹介されています。今回はいきなりMNISTではなく、scikit-learnのdigitsというより単純なデータセ

xef 2014/02/02

リンク

sklearnで手書き文字認識 - 忘れないようにメモっとく

Python Advent Calendar 2013 - Adventar、20日目の記事です。 pythonの機械学習ライブラリのsklearnが便利。ドキュメントも充実してるし、機械学習のアルゴリズムに詳しくなくても手軽に使えるので、動かしながら勉強できる。 ↓ sklearnでできること(一部) データセット sklearn.datasets クラスタリング sklearn.cluster クロスバリデーション sklearn.cross_validation 行列の分解(PCA等) sklearn.decomposition アンサンブル学習 sklearn.ensem ble 線形モデル sklearn.linear_model モデル評価 sklearn.metrics 近傍法 sklearn.neighbors SVM sklearn.svm 決定木 sklearn.tre

xef 2013/12/23

scikit-learn

リンク

Kernel Density Estimation in Python | Pythonic Perambulations

Last week Michael Lerner posted a nice explanation of the relationship between histograms and kernel density estimation (KDE). I've made some attempts in this direction before (both in the scikit-learn documentation and in our upcoming textbook), but Michael's use of interactive javascript widgets makes the relationship extremely intuitive. I had been planning to write a similar post on the theory

xef 2013/12/03

リンク

機械学習ライブラリ scikit-learn で簡単ツイート分類 | ぱろすけのメモ帳

皆様こんにちは。今日も元気に自分を見つめ直していますか？自分を見つめ直す手段として、過去の自分の発言を見返すことは有用だと考えられます。たとえば、Twitter を使っている方ならば、その過去ログを用いることが考えられるでしょう。Twitter の過去ログはライフログとして機能し、それを用いることにより、過去の出来事のみならず、考え方、感じ方、ふとした着想、などなどを読み取ることができます。しかしながら、過去のツイートを全部見返すのは非常に面倒ですね。たとえば僕のアカウントはトータルで4万ツイート以上あります。これは非常にめんどい。Twitter は情報量多くてだるいよね問題については、従来「再生核 Hilbert 空間を用い、脱ベイズ確率主義に立脚したあまりだるくないツイッター」などの試みが行われてきましたが、今なおその面倒さを軽減する手段については十分とはいえません。本記事では、過去の

xef 2013/11/13

リンク

GitHub - dclambert/pyensemble: An implementation of Caruana et al's Ensemble Selection algorithm in Python, based on scikit-learn

xef 2013/11/10

リンク

Machine Learning with scikit-learn

Overview Basic concepts of machine learning Introduction to scikit-learn Some useful algorithms Selecting a model Working with text data scikit-learn Collection of machine learning algorithms and tools in Python. BSD Licensed, used in academia and industry (Spotify, bit.ly, Evernote). ~20 core developers. Take pride in good code and documentation. We want YOU to participate! Supervised learning Tr

xef 2013/10/07

リンク

scikit-learnでサポートベクトル回帰、及びそのパラメーター推計 with クロスバリデーションやってみる - My Life as a Mock Quant

サポートベクトル回帰(Support Vector Regression, SVR)の理論が大体脳内整理出来たので、実践もしたいぞと、そしてちょいとpythonを使う別件があるので、慣れの意味も込めてR言語ではなくpythonとその機械学習ライブラリであるscikit-learnを使ってやるぞとそういうことです。 scikit-learn自体のインストールはこの記事の最下部にある日本語のLINKを見れば良いと思う。俺はpip使ってインストールしたような気がするけど、なにぶんずいぶんと昔なので忘れてしまった。pipで入れるなら pip install scikit-learnでOK。裏でコンパイルが走っていたような記憶があるので、C++のコンパイラいれておかないとだめかも。 windows用のバイナリファイルだと Scikit Learn - Browse Files at SourceF

xef 2013/08/26

scikit-learn

リンク

Gaël Varoquaux » Blog Archive » Scikit-learn 0.14 release: features and benchmarks

I have tagged and released the scikit-learn 0.14 release yesterday evening, after more than 6 months of heavy development from the team. I would like to give a quick overview of the highlights of this release in terms of features but also in term of performance. Indeed, the scikit-learn developers believe that performance matters and strive to be fast and efficient on fairly big datasets. I will

xef 2013/08/09

scikit-learn

リンク

Scikit-learn sprint and 0.14 release candidate (Update: binaries available :)

Yesterday a week-long scikit-learn coding sprint in Paris ended. And let me just say: a week is pretty long for a sprint. I think most of us were pretty exhausted in the end. But we put together a release candidate for 0.14 that Gael Varoquaux tagged last night. You can install it via: pip install -U https://github.com/scikit-learn/scikit-learn/archive/0.14a1.zip There are also tarballs on github

xef 2013/07/30

scikit-learn

リンク

Combining Scikit-Learn and NTLK

A collection of sloppy snippets for scientific computing and data visualization in Python. In Chapter 6 of the book Natural Language Processing with Python there is a nice example where is showed how to train and test a Naive Bayes classifier that can identify the dialogue act types of instant messages. Th classifier is trained on the NPS Chat Corpus which consists of over 10,000 posts from instan

xef 2013/07/24

リンク

pythonの機械学習ライブラリscikit-learnの紹介 - 唯物是真 @Scaled_Wurm

scikit-learn(sklearn)の日本語の入門記事があんまりないなーと思って書きました。どちらかっていうとよく使う機能の紹介的な感じです。英語が読める方は公式のチュートリアルがおすすめです。 scikit-learnとは？ scikit-learnはオープンソースの機械学習ライブラリで、分類や回帰、クラスタリングなどの機能が実装されています。また様々な評価尺度やクロスバリデーション、パラメータのグリッドサーチなどの痒いところに手が届く機能もあります。インストール scikit-learnの他にもnumpyとかscipyとかが必要です。 Windows 64 bit版の人は以下のURLに色々なインストーラーがおいてあるのでおすすめ Python Extension Packages for Windows - Christoph Gohlke その他の人は以下のURLを見て