[B! algorithms] fcicqのブックマーク

fcicq id:fcicq

algorithmsに関するfcicqのブックマーク (582)

LZ_XOR
fcicq 2022/08/10
xor len distance bytes. note lzham author. note shuffle and align?

algorithms

compression
リンク
Faster Inverse BWT
The BWT (Burrows Wheeler Transf orm) has long fascinated people for its ability to capture complex correlations with a very simple inverse transf orm. Unfortunately despite that inverse transf orm being very simple, it is also slow. I will briefly review the inverse BWT (i-BWT) and then look at ways to speed it up. Jump to the end for the punch line : speed results and source code. Let's briefly revi
fcicq 2022/08/02
ibwt

algorithms
リンク
Ask HN: What are some cool but obscure data structures you know about? | Hacker News
I'm very interested in what types of interesting data structures are out there HN. Totally your preference.I'll start: bloom filters. Lets you test if a value is definitely NOT in a list of pre-stored values (or POSSIBLY in a list - with adjustable probability that influences storage of the values.) Good use-case: routing. Say you have a list of 1 million IPs that are black listed. A trivial algor
fcicq 2022/07/23
algorithms

resources
リンク
Interval Tree - GeeksforGeeks
Consider a situation where we have a set of intervals and we need following operations to be implemented efficiently. 1) Add an interval 2) Remove an interval 3) Given an interval x, find if x overlaps with any of the existing intervals. Interval Tree: The idea is to augment a self-balancing Binary Search Tree (BST) like Red Black Tree, AVL Tree, etc to maintain set of intervals so that all operat
fcicq 2022/07/22
algorithms
リンク
Wavelet Tree - Wikipedia
A wavelet tree on the string "abracadabra". At each node the symbols of the string are projected onto two partitions of the alphabet, and a bitvector denotes to which partition each symbol belongs. Note that only the bitvectors are stored; the strings in the nodes are only for illustratory purposes. The Wavelet Tree is a succinct data structure to store strings in compressed space. It generalizes
fcicq 2022/06/02
see also csa/fm-index

algorithms
リンク
Changing std::sort at Google’s Scale and Beyond
TL;DR; We are changing std::sort in LLVM’s libcxx. That’s a long story of what it took us to get there and all possible consequences, bugs you might encounter with examples from open source. We provide some benchmarks, perspective, why we did this in the first place and what it cost us with exciting ideas from Hyrum’s Law to reinforcement learning. All changes went into open source and thus I can
fcicq 2022/04/21
have read. points: pivot (arxiv 1606.00484), sorting network, presortedness, simd block swap, nth element, weak ordering

algorithms

interesting

c++

library

***
リンク
Algorithms for Modern Hardware - Algorithmica
This is an upcoming high performance computing book titled “Algorithms for Modern Hardware” by Sergey Slotin. Its intended audience is everyone from performance engineers and practical algorithm researchers to undergraduate computer science students who have just finished an advanced algorithms course and want to learn more practical ways to speed up a program than by going from $O(n \log n)$ to $
fcicq 2022/03/08
algorithms

hardware

memory
リンク
Ribbon filter: Practically smaller than Bloom and Xor
What the research is: The Ribbon filter is a new data structure that is more space-efficient than the popular Bloom filters that are widely used for optimizing data retrieval. One of the ways that Bloom, and now Ribbon, filters solve real engineering probl ems is by providing smooth configurability unmatched by other filters. Bloom filters work by overapproximating a set of keys associated with som
fcicq 2022/01/14
algorithms
リンク
How does Audio Fingerprinting work
Posted by Sergiu Ciumac on June 12, 2020 · 23 mins read Audio Fingerprinting I have been developing the SoundFingerprinting open source project for the last ten years. One of the questions I often receive is “how does music recognition works?” For the library users, it is somewhat similar to a one-way hash function. You provide a file at the input, and after a certain number of conversions, you ge
fcicq 2022/01/03
neighborhood search

algorithms
リンク
google-research/scann at master · google-research/google-research
ScaNN (Scala ble Nearest Neighbors) is a method for efficient vector similarity search at scale. This code implements [1, 2], which includes search space pruning and quantization for Maximum Inner Product Search and also supports other distance functions such as Euclidean distance. The implementation is designed for x86 processors with AVX2 support. ScaNN achieves state-of-the-art performance on an
fcicq 2021/12/25
commercial ver: Vertex AI Matching Engine. image search: mobilenet v2 embedding

search

algorithms

python

machinelearning
リンク
Python言語による実務で使える100+の最適化問題 | opt100
はじめに本書は，筆者が長年書き溜めた様々な実務的な最適化問題についてまとめたものである．本書は，Jupyter Laboで記述されたものを自動的に変換したものであり，以下のサポートページで公開している．コードも一部公開しているが，ソースコードを保管した Github 自体はプライベートである．本を購入した人は，サポートページで公開していないプログラムを圧縮ファイルでダウンロードすることができる．ダウンロードしたファイルの解凍パスワードは<本に記述>である．作者のページ My HP 本書のサポートページ Support Page 出版社のページ Pythonによる実務で役立つ最適化問題100+ (1) ―グラフ理論と組合せ最適化への招待― Pythonによる実務で役立つ最適化問題100+ (2) ―割当・施設配置・在庫最適化・巡回セールスマン― Pythonによる実務で役立つ
fcicq 2021/12/14
python

reference

algorithms
リンク
GitHub - nadavrot/memset_benchmark: This repository contains high-performance implementations of memset and memcpy in assembly.
fcicq 2021/11/12
Prepare big array of 0x01010101 * c

interesting

algorithms
リンク
GitHub - madler/crcany: Compute any CRC, a bit at a time, a byte at a time, and a word at a time.
crcany is a suite of programs that generalize CRC calculations, and that generate C code to compute and combine CRCs efficiently. Any CRC can be computed given the set of parameters that describe it. Those parameters are provided in the form as used by Greg Cook's catalog of over one-hundred CRCs, found at https://reveng.sourceforge.io/crc-catalogue/all.htm . That set of parameters were first defi
fcicq 2021/10/12
algorithms
リンク
Index 1,600,000,000 Keys with Automata and Rust - Andrew Gallant's Blog
It turns out that finite state machines are useful for things other than expressing computation. Finite state machines can also be used to compactly represent ordered sets or maps of strings that can be searched very quickly. In this article, I will teach you about finite state machines as a data structure for representing ordered sets and maps. This includes introducing an implementation written
fcicq 2021/08/21
not AC. https://github.com/BurntSushi/fst

algorithms

rust
リンク
GitHub - rivo/duplo: Detect duplicate (or similar) images. Written in Go.
fcicq 2021/08/07
similar image search. Perceptual hash

golang

algorithms
リンク
https://cantrip.org/sortfast.html
fcicq 2021/07/17
swap_if for qsort, clang compiles to cmov

algorithms

compiler
リンク
Faster sorted array unions by reducing branches – Daniel Lemire's blog
fcicq 2021/07/15
part of merge sort

algorithms
リンク
Ribbon filter: practically smaller than Bloom and Xor
Filter data structures over-approximate a set of hashable keys, i.e. set membership queries may incorrectly come out positive. A filter with false positive rate $f \in (0,1]$ is known to require $\ge \log_2(1/f)$ bits per key. At least for larger $f \ge 2^{-4}$, existing practical filters require a space overhead of at least 20% with respect to this information-theoretic bound. We introduce the Ri
fcicq 2021/07/12
in rocksdb

algorithms
リンク
DataSketches |
A software library of stochastic streaming algorithms "A truly excellent example of theoretically-informed algorithm engineering" -- Graham Cormode The Business Challenge: Analyzing Big Data Quickly. In the analysis of big data there are often probl em queries that don’t scale because they require huge compute resources and time to generate exact results. Examples include count distinct, quantiles,
fcicq 2021/07/01
note former pig? stream / sketches

algorithms
リンク
Guide to making high-quality thumbnails
fcicq 2021/06/12
algorithms
リンク
1 2 3 4 5 6 7 8 9 10 次のページ