Bookmark
Using Let's Encrypt for internal servers
https://blog.heckel.io/2018/08/05/issuing-lets-encrypt-certificates-for-65000-internal-servers/, posted 2021 by peter in development howto networking security toread
But while there are many tools to automatically renew certificates for publicly available webservers (certbot, simp_le, I wrote about how to do that 3 years back), it's hard to find any useful information about how to issue certificates for internal non Internet facing servers and/or devices with Let's Encrypt.
Bookmark
Learning Containers From The Bottom Up
https://iximiuz.com/en/posts/container-learning-path/, posted 2021 by peter in deployment development learning linux toread virtualization
This article is not an attempt to explain containers in one go. Instead, it's a front-page for my multi-year study of the domain. It outlines the said learning path and then walks you through it, pointing to more in-depth write-ups on this same blog.
Mastering containers is no simple task, so take your time, and don't skip the hands-on parts!
Bookmark
Conventional Commits
https://www.conventionalcommits.org/, posted 2021 by peter in development documentation git reference versioncontrol
The Conventional Commits specification is a lightweight convention on top of commit messages. It provides an easy set of rules for creating an explicit commit history; which makes it easier to write automated tools on top of. This convention dovetails with SemVer, by describing the features, fixes, and breaking changes made in commit messages.
Bookmark
The Wrong Abstraction
https://sandimetz.com/blog/2016/1/20/the-wrong-abstraction, posted 2021 by peter in development
The moral of this story? Don't get trapped by the sunk cost fallacy. If you find yourself passing parameters and adding conditional paths through shared code, the abstraction is incorrect. It may have been right to begin with, but that day has passed. Once an abstraction is proved wrong the best strategy is to re-introduce duplication and let it show you what's right. Although it occasionally makes sense to accumulate a few conditionals to gain insight into what's going on, you'll suffer less pain if you abandon the wrong abstraction sooner rather than later.
When the abstraction is wrong, the fastest way forward is back. This is not retreat, it's advance in a better direction. Do it. You'll improve your own life, and the lives of all who follow.
Bookmark
Faster Maven builds
https://blog.frankel.ch/faster-maven-builds/1/, posted 2021 by peter in development howto java performance reference
I want to detail some techniques you can leverage to make your Maven builds faster in this post. The following post will focus on how to do the same inside of Docker.
Bookmark
Typesense | Fast, typo-tolerant open source search engine
https://typesense.org/, posted 2021 by peter in development free opensource search software
Typesense is an open source, typo tolerant search engine that is optimized for instant sub-50ms searches, while providing an intuitive developer experience.
Bookmark
A list of free data matching and record linkage software
https://github.com/J535D165/data-matching-software, posted 2021 by peter in development free list nlp opensource software
This is a list of (Fuzzy) Data Matching software. The software in this list is open source and/or freely available.
The term data matching is used to indicate the procedure of bringing together information from two or more records that are believed to belong to the same entity. Data matching has two applications: (1) to match data across multiple datasets (linkage) and (2) to match data within a dataset (deduplication). See the Wikipedia page about data matching for more information.
Similar terms: record linkage, data matching, deduplication, fuzzy matching, entity resolution
Bookmark
How to calculate the alignment between BERT and spaCy tokens effectively and robustly
https://gist.github.com/tamuhey/af6cbb44a703423556c32798e1e1b704, posted 2021 by peter in development free language nlp opensource software toread
Suppose we want to combine a BERT-based named entity recognition (NER) model with a rule-based NER model built on top of spaCy. Although BERT's NER exhibits extremely high performance, it is usually combined with rule-based approaches for practical purposes. In such cases, what often bothers us is that tokens of spaCy and BERT are different, even if the input sentences are the same. For example, let's say the input sentence is "John Johanson 's house"; BERT tokenizes this sentence like
["john", "johan", "##son", "'", "s", "house"]
and spaCy tokenizes it like["John", "Johanson", "'s", "house"]
. To combine the outputs, we need to calculate the correspondence between the two different token sequences. This correspondence is the "alignment".
Bookmark
h-feed - Microformats Wiki
https://microformats.org/wiki/h-feed, posted 2021 by peter in development html rss standard webdesign
h-feed is a simple, open format for publishing a stream or feed of h-entry posts, like complete posts on a home page or archive pages, or summaries or other brief lists of posts. h-feed is one of several open microformat draft standards suitable for embedding data in HTML.
Bookmark
Coreutils Gotchas
https://www.pixelbeat.org/docs/coreutils-gotchas.html, posted 2021 by peter in development linux list reference shell
We make very careful considerations about the interface and operation of the GNU coreutils, but unfortunately due to backwards compatibility reasons, some behaviours or defaults of these utilities can be confusing.
This information will continue to be updated and overlaps somewhat with the coreutils FAQ, with this list focusing on less frequent potential issues.
|< First < Previous 29–38 (528) Next > Last >|