[B! delta encoding] yassのブックマーク

yass id:yass

delta encodingに関するyassのブックマーク (8)

Frame of Reference and Roaring Bitmaps
Search and analytics, data ingestion, and visualization – all at your fingertips.
yass 2016/06/11
integer

compression

delta encoding

bit packing

lucene

BitSet

bitmap index

roaringbitmap
リンク
http://www.vldb.org/pvldb/vol8/p1816-teller.pdf
yass 2015/10/12
facebook

compression

integer

float

xor

delta encoding

time series database
リンク
Efficient Data Storage for Analytics with Parquet 2.0 - Hadoop Summit 2014
Efficient Data Storage for Analytics with Parquet 2.0 - Hadoop Summit 2014 Apache Parquet is an open-source columnar storage format for efficient data storage and analytics. It provides efficient compression and encoding techniques that enable fast scans and queries of large datasets. Parquet 2.0 improves on these efficiencies through techniques like delta encoding, dictionary encoding, run-length
yass 2014/06/25
parquet

delta encoding

columnar storage

integer

compression

branch prediction
リンク
Compression encodings - Amazon Redshift
Amazon Redshift will no longer support the creation of new Python UDFs starting November 1, 2025. If you would like to use Python UDFs, create the UDFs prior to that date. Existing Python UDFs will continue to function as normal. For more information, see the blog post . Compression encodings A compression encoding specifies the type of compression that is applied to a column of data values as row
yass 2013/09/23
" if the column contains 10 integers in sequence from 1 to 10, the first will be stored as a 4-byte integer (plus a 1-byte flag), and the next 9 will each be stored as a byte with the value 1 / the full original value is stored, with a leading 1-byte flag. "

redshift

compression

delta encoding

integer

columnar storage
リンク
JAWS UG中央線 LT Redshift Compression Encodings（圧縮アルゴリズム）
隠れたデータベースの遅延原因を特定し、そのレスポンスの改善手法紹介 @ dbtech showcase Tokyo 2019
yass 2013/09/23
redshift

compression

RLE

delta encoding
リンク
Binary fingerprint compression - Lukáš Lalinský
yass 2013/09/07
" If I use XOR instead of subtraction for the delta encoding, I will still get high numbers, but they will usually not have many bits set. "

compression

integer

XOR

delta encoding

fingerprint
リンク
List::FrontCode - naoyaのはてなダイアリー
先日 Array::Gap という Variable Byte Codes による整列済み整数の圧縮の実装を作りました。(id:naoya:20080906:1220685978) 今日は Front Coding を使った同じような圧縮リストクラス、List::FrontCode を作ってみました。Front Coding は辞書式順に整列済みの文字列リストなどを圧縮する手法です。WEB+DB PRESS Vol.42 のアルゴリズム&データ構造の記事で PFI の岡野原さんによる解説があったので、それを参考に実装しました。 Front Coding Front Coding は http://www.hoge.jp http://www.hoge.jp/a.htm http://www.hoge.jp/index.htm http://www.fuga.com/ http://www.
yass 2013/04/21
front coding

encoding

compression

algorithm

string

diff

delta encoding
リンク
差分符号化 - Wikipedia
差分符号化（さぶんふごうか、英: Delta encoding）とは、データの格納や転送を完全なファイルとしてではなく、シーケンシャルなデータの差分の形式で行う方式である。特に変更履歴の保存を目的とする場合（ソフトウェアプロジェクトなど）、差分符号化は差分圧縮（英: Delta compression）とも呼ばれる。デルタ符号化、デルタ圧縮とも呼ばれるが、デルタ符号とは異なる。例えばUNIXのファイル比較ユーティリティである diff などで「差分」または「デルタ」を作成し、個別にファイルとして記録する。差分は一般に元のファイルよりも小さいので、差分符号化によってデータの冗長性を大幅に削減できる。一連の差分ファイルの方が各バージョンのそのままのファイル群よりも大幅に記録容量が節約できる。論理的観点から言えば、2つのデータの差分があれば、一方のデータからもう一方のデータを得ることができる
yass 2012/11/10
algorithm

delta encoding

encoding

integer
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx