[B! r] gandenのブックマーク

ganden id:ganden

rに関するgandenのブックマーク (58)

Selected R Packages, from RStudio
ganden 2017/05/20
data_science

dashboard

R

visualization
リンク
Exploratory
Exploratory Desktop provides a Simple and Easy-to-Use UI experience to access various data sources, clean and transf orm data, visualize and analyze data to gain deeper insights, communicate your discoveries with Notes, and monitor your business metrics with Dashboards.
ganden 2017/05/20
data

visualization

R

data_science
リンク
Running RStudio Workbench / RStudio Server Pro with a Proxy – RStudio Support
Overview If you are running RStudio Workbench (previously RStudio Server Pro) behind a proxy server, you need be sure to configure the proxy server so that it correctly handles all traffic to and from RStudio Workbench. Beyond the normal reverse proxy configuration you'd apply for any HTTP server application, you also need to to ensure that websockets are forwarded correctly between the proxy serv
ganden 2017/02/08
rstudio

r
リンク
モダンなRによるテキスト解析 - Qiita
概要すぐに使えるKNBCコーパスを対象に、モダンなRの書き方でテキスト解析したときのメモです。TF-IDFや共起頻度（ネットワーク作成）、LDAやGloVeまでをパッケージで実行しました。解析済みブログコーパス定義・設定最初に処理で利用するライブラリの読み込みや定数・関数の定義。 library(pacman) library(widyr) # 読み込むパッケージ SET_LOAD_PACKAGE <- c("tidyverse", "Rcpp", "chunked", "tidytext", "visNetwork", "textmineR", "Matrix", "topicmodels", "LDAvis", "text2vec") # コーパスファイルの設定 SET_CORPUS_FILE <- list( DOWNLOAD_URL = "http://nlp.ist.i.
ganden 2016/09/12
mecab

r

nlp
リンク
⭐️Rを使ったモデル構築の最善策を求めて: {dplyr} + {tidyr} + {broom} + {purrr}を使ったアプローチ - cucumber flesh
RStudioのチーフサイエンティスト、Hadley Wickham（ハドリー）が２月に行った講演のビデオがYouTubeに上がっていたので観た。 "Making Data Analysis Easier"というタイトルでの発表(スライドでは"Managing many models"になっているけど)で、ハドリー自身が考えている、データサイエンスに必要な可視化やモデリングを効率的に行うための手法について、彼の開発してきたパッケージを中心に説明している。 www.youtube.com 分かりやすく、具体例を交えた内容なので、是非YouTubeの動画を観てもらうのが良いと思うが、自分の頭を整理するためにもここでまとめておく。なお、発表スライドはクリエイティブ・コモンズライセンス3.0のもと、表示・非営利のラインセンスで再利用可能となっている。 Hadley Wickham (Chief S
ganden 2016/07/07
r

data_mining

machine_learning

dplyr
リンク
On ranger respect.unordered.factors | R-bloggers
ganden 2016/05/31
machine_learning

random_forest

r
リンク
deployr revolutionanalytics - Bing
https://blog.revolutionanalytics.com/2015/08/d…このページを翻訳 2015/08/17 · by Carl Nan, DeployR PM A new version of DeployR, the server-based framework that provides simple and secure R integration for application developers, is now available. (If you're new to DeployR, take a look at the DeployR Overview or download the white paper, Using DeployR to Solve the R Integration Probl em.) The following list
ganden 2015/12/05
development

api

r
リンク
GitHub - bmschmidt/wordVectors: An R package for creating and exploring word2vec and other word embedding models
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
ganden 2015/12/05
word2vec

r
リンク
Anaconda for R users
Anaconda is a popular open-source Python distribution that includes more than 200 packages for scientific computing and data science. Recently, the …
ganden 2015/11/05
r

python

anaconda
リンク
RPubs - Practical Machine Learning Project with XGBoost
Hide Comments (–) Share Hide Toolbars
ganden 2015/11/04
r

xgboost

data_mining

machine_learning
リンク
Intree: R package for randomforest interpretation
Intree: R package for randomforest interpretation 1. 森を見たい “Interpreting Tree Ensem bles with inTrees” inTrees package (by Houtao Deng) を紹介します第51回R勉強会＠東京（#TokyoR） 2. ランダムフォレスト学習データのランダムサブセットで構築した様々な決定木の集合（＝森）の予測結果を統合する  分類 → 多数決  回帰 → 平均 ALL DATA Random subset Random subset Random subset … 3. 特徴変数の重要度も評価できますどれだけ予測力に貢献しているかという情報をもとに特徴変数の重要度を評価する 4. ランダムフォレスト学習データのランダムサブセットで構築した様々な決定木の集合（＝
ganden 2015/10/14
r

random_forest
リンク
NMF: Algorithms and Framework for Nonnegative Matrix Factorization (NMF)
NMF: Algorithms and Framework for Nonnegative Matrix Factorization (NMF) Provides a framework to perform Non-negative Matrix Factorization (NMF). The package implements a set of already published algorithms and seeding methods, and provides a framework to test, develop and plug new/custom algorithms. Most of the built-in algorithms have been optimized in C++, and the main interface function provid
ganden 2015/09/24
nmf

cran

r
リンク
RでランダムフォレストやるならRboristかrangerか - 盆栽日記
最近Rにおけるランダムフォレストの高速な実装としてrangerパッケージが発表された。開発者が既存のランダムフォレスト実装パッケージと比較した論文をarxivに掲載している。 http://arxiv.org/pdf/1508.04409v1.pdf rangerは速い…のか？既存のランダムフォレスト実装としてrandomForest、randomForest、bigrf、randomForestSRC、Random Jungle、Rboristが比較されている。私が扱うデータはほとんどがサンプルサイズ>>特徴量数というデータなので、Table2とFigure4が比較結果として参考になる。 Table2ではサンプルサイズ100,000、特徴量数100というデータに対して各パッケージの処理速度とメモリ消費量を比較している。ざっとみた感じ高速なのは二値型の特徴量（dichotomous
ganden 2015/08/31
r

machine_learning

data_mining

random_forest
リンク
New package "dplyrr" – Utilities for comfortable use of dplyr with databases | R-bloggers
R-bloggers R news and tutorials contributed by hundreds of R bloggers [This article was first published on HOXO-M - anonymous data analyst group in Japan - , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. 1. Overview dplyr is the most powerful package fo
ganden 2015/08/06
r

dplyr
リンク
dplyrを使いこなす！基礎編 - Qiita
はじめに 4月ということで、新卒が入ってきたりRを使ったことないメンバーがJOINしたりしたので、超便利なdplyrの使い方を何回かに分けてまとめて行きます。 Rは知らないけど、SQLとか他のプログラミング言語はある程度やったことあるみたいな人向けです。 dplyrを使いこなす！シリーズ基礎編以外も書きましたので、↓からどうぞ。 dplyrを使いこなす！Window関数編 dplyrを使いこなす！JOIN編 dplyrとはデータフレームの操作に特化したパッケージです。 Rは基本的に処理速度はあまり早くないですが、dplyrはC++で書かれているのでかなり高速に動作します。ソースの可読性もよくなるので、宗教上の理由で禁止されている人以外は使うメリットは大きいです。処理可能なデータサイズの目安あくまでも個人の環境に強く依存した感覚値ですが、1000万行、100MBぐらいのデータサイ
ganden 2015/07/22
r
リンク
Rで解析：データの特徴を一気に確認。「GGally」パッケージ
「ggplot2」パッケージを利用して多変数の特徴をプロットすることができるパッケージの紹介です。データの解釈に非常に便利なパッケージかと思います。また、複数のggplotオブジェクトのプロットが可能な「ggmatrix」コマンドも収録されています。パッケージバージョンは2.1.2。実行コマンドはwindows 11のR version 4.1.2で確認しています。 #パッケージの読み込み library("GGally") ###データ例の作成##### n <- 50 TestData <- data.frame(Group = sample(pas
ganden 2015/07/22
r

visualization

ggplot
リンク
htmlwidgets for R - gallery
ganden 2015/07/18
r

visualization
リンク
Googleがリリースした「キャンペーンとKPIとの因果関係を推定する」Rパッケージ{CausalImpact}を試してみた - 渋谷駅前で働くデータサイエンティストのブログ
何気なくR-Bloggerのタイムラインを見ていたら、"CausalImpact: A new open-source package for estimating causal effects in time series | Google Open Source Blog"という記事がシェアされていたので見に行ってみたのでした。これはもう読んで字の如く「GoogleがキャンペーンがKPIにもたらす因果的影響を時系列から推定する」ためのRパッケージの話題で、その名も{CausalImpact}という。ということで、ちろっと触ってみたので簡単にレビューしてみようと思います。本当は色々試してみたかったんですが、ちょっと手元に良いデータがないのでヘルプの事例のみでご勘弁を。。。インストール追記 (Jan 29 2020) 現在はCRANからインストールできます。 install.pack
ganden 2014/09/19
data_science

r
リンク
PythonとRによるデータ分析環境の構築と機械学習によるデータ認識
This document introduces deep reinforcement learning and provides some examples of its applications. It begins with backgrounds on the history of deep learning and reinforcement learning. It then explains the concepts of reinforcement learning, deep learning, and deep reinforcement learning. Some example applications are controlling building sway, optimizing smart grids, and autonomous vehicles. T
ganden 2014/09/05
python

r

machine_learning
リンク
Release dplyr 0.2 · tidyverse/dplyr
ganden 2014/07/01
r
リンク
1 2 3 次のページ