[B! merge sort] yassのブックマーク

yass id:yass

merge sortに関するyassのブックマーク (13)

GitHub - cowtowncoder/java-merge-sort: Basic stand-alone disk-based N-way merge sort component for Java
yass 2018/11/20
java

sort

merge sort
リンク
MySQLのfilesortは何ソートで行われているのか - $shibayu36->blog;
最近、CourseraのArgorithms, Part1という講義を受けている。そこでソートの講義を受けて、そういえばMySQLのORDER BYでfilesortになったときってどのソートが使われているのだろうと気になってきたので調べてみた。調べてみると非常に難解で、結局いまいち分からなかったが、今の段階の調べた内容をひとまず書いておく。MySQLのコードを読んだのも初めてで、しかもちゃんと読み解くことができなかったので、情報が間違っている可能性も非常に高い。間違ってたら指摘してもらえるとうれしいです。調査結果最初に調査結果を書いておく。たぶんこれは非常に単純化したもので、詳しく見るともっといろいろチューニングされてそう。 sort_buffer_size以内のメモリ量でソートが可能な場合、メモリ内でのみソートされるソートにsort_buffer_size以上のメモリが必要な場
yass 2017/04/12
MySQL

sort

merge sort
リンク
[KMP77] を読んでみた - d.y.d.
17:53 14/12/22 ソートの逆流れクイックソートってあるじゃないですか、クイックソート。配列、たとえば [4,2,1,7,0,6,5,3] があったときに、小さい方を左に、大きい方を右にまず適当に集める。この「小さい方」と「大きい方」への二分割を、いわゆる再帰的に、分かれたブロック両方で同じ事を繰り返していくと… なんと、小さい順に並んだ配列 [0,1,2,3,4,5,6,7] が出来上がるというアルゴリズムです。逆向き！このデータの流れを「逆向きに」見てみたい。つまり、ソートが終わった最終状態から話が始まります。しかも、さっきから説明なしで意味ありげにくっついていた、「入力配列で元々どの位置にあったか」を表す値に注目していきます。 0の上に[4]がくっついているのは、最初は値0は配列のインデックス[4]の位置にあった、ということを意味しています。（上のソート
yass 2017/01/03
sort

quick sort

merge sort

Algorithm
リンク
What is the difference in idea, design and code, between Apache Spark and Apache Hadoop?
Answer (1 of 17): When I was getting started with using Apache Spark, I had the same question. From everything I heard, it seemed as if Spark does the same things as Mapreduce but better and faster. But, as it turns out that’s not the case. A few resources (linked below) have helped me with that ...
yass 2015/11/22
hadoop

spark

comparison

mapreduce

merge sort
リンク
並列データベースシステムの概念と原理
3. 講義内容  序論 - 並列データベースの前に  並列処理の基礎   並列処理のTerminology 並列計算機アーキテクチャ  並列データベースのアーキテクチャ  データベース処理の並列化  結合処理の高速化     並列ハッシュ結合並列ソートパーティショニング手法多重結合や計算機間のデータ交換で発生する問題  MapReduceによる関係演算の並列処理 3 4. データベース開発の流れ  Coddの論文: 1970年     System RやIngres: 70年代中盤 Oracle, IBM DB2, Ingres: 80年代序盤並列データベースの隆盛: 80年代後半   A Relational Model of Data for Large Shared Data Banks, Communications of ACM 商用
yass 2014/02/02
MapReduce

database

join

sort

merge sort

partitioning
リンク
How MySQL executes ORDER BY
In last couple of weeks there has been a tide of ORDER/GROUP BY-related optimization bugs, where I was the fixer or the reviewer. This wasn’t an easy job because there is no sane description of how GROUP BY/ORDER BY handling is supposed to work. To figure it out, I had to write an explanation of how it works. The first part is about ORDER BY. Hopefully there will be subsequent parts that will show
yass 2013/10/19
" In a nutshell, filesort() does quicksort on chunks of data that fit into its memory and then uses mergesort approach to merge the chunks. / if the sorted data doesn’t fit into memory (i.e. there is more than one chunk), filesort uses a temporary file to store the chunks. "

mysql

sort

merge sort

sort_buffer_size

quick sort
リンク
[TECH] Algorithmic details of UNIX Sort command.
Algorithms, Theory, Spirituality, Life, Techno logy, Food and Workout : trying to sort these deterministically in $\Theta(1)$ time (constant time). I happened to look at the algorithmic details of UNIX Sort, a LINUX version of the classic UNIX sort is a part of GNU coreutils-6.9.90. This is classic example of the standard External R-Way merge , to sort a data of size N bytes with a main memory size
yass 2013/10/18
" This is classic example of the standard External R-Way merge , to sort a data of size N bytes with a main memory size of M so it creates N/M runs and merges R at a time, the number of passes through the data is log(N/M)/log(R) passes. "

sort

linux

merge sort
リンク
sinbadsoft.com
This domain may be for sale!
yass 2013/09/21
" first, split the file into small chunks that would fit in memory, load each chunk, sort it, and write it back on disk. Second, perform a k-way merge on all the sorted chunks to get the final result. "

sort

merge sort
リンク
MemtableSSTable_JP - Cassandra Wiki
Overview Cassandraの書き込みはまずコミットログ(Commit Log)に対して行われます。そしてColumnFamilyごとにMemtableと呼ばれる構造体に対して書き込まれます。Memtableは基本的にキーで参照可能なデータ行のライトバックキャッシュです。つまりライトスルーキャッシュと違ってSSTableとしてディスクに書き込まれる前に、Memtableが一杯になるまで書き込まれます。 Flushing MemtableをSSTableへ変換するプロセスをフラッシュ(flushing)と呼びます。JMX経由で(例えばnodetoolを使用して)手動でフラッシュを実行することも可能です。コミットログのリプレイ時間を短くするためにノードを再起動する前に行った方が良いでしょう。Memtableはキーでソートされ、シーケンシャルに書き出されます。したがって書き込みは超高速に
yass 2013/09/21
" Memtableのフラッシュ時のサイズと同じサイズから始まって、サイズが最大N倍になりながら階層的に形作 / 入力元となるSSTableはすべてキーでソートされているため、マージは効率良く行われランダムI/Oを必要としません "

cassandra

SSTable

sort

merge sort
リンク
GitHub - lemire/externalsortinginjava: External-Memory Sorting in Java
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
yass 2013/03/27
java

Algorithm

sort

merge sort
リンク
Ryan Marcus · UPenn
Ryan Marcus, assistant professor at the University of Pennsylvania. Using machine learning to build the next generation of data systems. ____ __ ___ / __ \__ ______ _____ / |/ /___ _____________ _______ / /_/ / / / / __ `/ __ \ / /|_/ / __ `/ ___/ ___/ / / / ___/ / _, _/ /_/ / /_/ / / / / / / / / /_/ / / / /__/ /_/ (__ ) /_/ |_|\__, /\__,_/_/ /_/ /_/ /_/\__,_/_/ \___/\__,_/____/ /____/ ___ __ ___
yass 2013/03/10
" I also needed to be able to query the k most frequently contacted contacts. / To solve this problem, I created an augmented binary tree which provides an insertion time of O(k log n), a search time of O( log n), and can find the top k contacts in O(1). "

data structure

top-k

sort

toread

augmented binary tree

merge sort

tree

binary tree
リンク
Intersecting Two Sorted Integer Arrays
An interesting probl em I've run to recently is the following (I tried to express it using Jon Bentley's convention): Input: Two sorted integer arrays A and B in increasing order and of different sizes N and M, respectively. Output: A sorted integer array C in increasing order that contains elements that appear in both A and B Contraints: No duplicates are allowed in C Example: For input A = {3,6,8
yass 2013/01/09
intersection

integer

sort

array

binary search

merge sort
リンク
stoimen's web log
Introduction Basically sorting algorithms can be divided into two main groups. Such based on comparisons and such that are not. I already posted about some of the algorithms of the first group. Insertion sort, bubble sort and Shell sort are based on the comparison model. The probl em with these three algorithms is that their complexity is O(n2) so they are very slow. So is it possible to sort a lis
yass 2012/03/06
sort

algorithm

merge sort
リンク
1