ghagerer | BibSonomy

bookmarks (hide)68
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
https://github.com/huggingface/pytorch-transformers
6 years ago by @ghagerer
show all tags
library
pre-trained
librarypre-trained
copydelete
- community post
- history of this post
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
6 years ago by @ghagerer
show all tags
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
copydelete
- community post
- history of this post
1Wasserstein metric - Wikipedia
In mathematics, the Wasserstein or Kantorovich–Rubinstein metric or distance is a distance function defined between probability distributions on a given metric space M {\displaystyle M} M. Intuitively, if each distribution is viewed as a unit amount of "dirt" piled on M {\displaystyle M} M, the metric is the minimum "cost" of turning one pile into the other, which is assumed to be the amount of dirt that needs to be moved times the mean distance it has to be moved. Because of this analogy, the metric is known in computer science as the earth mover's distance.
5 years ago by @ghagerer
show all tags
probability-distribution-similarity
probability-distribution-similarity
copydelete
- community post
- history of this post
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
a year ago by @ghagerer
show all tags
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
copydelete
- community post
- history of this post
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
2 years ago by @ghagerer
show all tags
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
copydelete
- community post
- history of this post
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
2 years ago by @ghagerer
show all tags
precision
recall
metrics
precisionrecallmetrics
copydelete
- community post
- history of this post
1Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org
We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.
2 years ago by @ghagerer
show all tags
llama
evaluation
llms
open-source
architecture
deployment
llamaevaluationllmsopen-sourcearchitecturedeployment
copydelete
- community post
- history of this post
1Incremental Versioned Datasets In Kedro
Kedro versioned datasets can be mixed with incremental and partitioned datasets [ �� Unsure what kedro is? Check out this post. This was a question presented to
11 months ago by @ghagerer
show all tags
kedro
kedro
copydelete
- community post
- history of this post
1Running dbt in Snowpark: A Step-by-Step Guide - TASMAN
In previous articles, we explored how Snowpark Container Services can open doors to a complete data stack running solely on Snowflake (here) and showcased all essential tools Snowflake provides to achieve this (here). Now, it’s time to dive into the practical side of things. This article will guide you through a step-by-step implementation of running dbt in Snowpark Container Services, covering everything from setup and containerisation all the way to scheduling and monitoring. If you’re trying to create a simple containerised dbt setup, this guide will help you put all theory into action!
8 months ago by @ghagerer
show all tags
snowflake
snowpark
dbt
snowflakesnowparkdbt
copydelete
- community post
- history of this post
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
6 years ago by @ghagerer
show all tags
comparison
fasttext
word2vec
comparisonfasttextword2vec
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)163
This list of posts may not be accurate to recent changes. If you want accurate posts, but with limited sorting follow this link.

display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI, D. Guo, D. Yang, H. Zhang, J. Song, R. Zhang, R. Xu, Q. Zhu, S. Ma, P. Wang and 190 other author(s). (2025)
5 months ago by @ghagerer
show all tags
reinforcement-learning
deepseek
llms
r1
reinforcement-learningdeepseekllmsr1
copydeleteadd this publication to your clipboard
2HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA
X. Chen, P. Gao, J. Song, and X. Tan. (2024)
10 months ago by @ghagerer
show all tags
rag
llms
document-databases
hierarchical-document-extraction
pdftriage
ragllmsdocument-databaseshierarchical-document-extractionpdftriage
copydeleteadd this publication to your clipboard
2Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
O. Ovadia, M. Brief, M. Mishaeli, and O. Elisha. (2024)
a year ago by @ghagerer
show all tags
rag
llms
fine-tuning
ragllmsfine-tuning
copydeleteadd this publication to your clipboard
2Dissecting learning and forgetting in language model finetuning
X. Zhang, and J. Wu. The Twelfth International Conference on Learning Representations, (2024)
a year ago by @ghagerer
show all tags
catastrophic-forgetting
llms
fine-tuning
ablation-study
catastrophic-forgettingllmsfine-tuningablation-study
copydeleteadd this publication to your clipboard
1Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
M. Suzgun, and A. Kalai. (2024)
a year ago by @ghagerer
show all tags
prompt-engineering
llms
meta-prompting
prompt-engineeringllmsmeta-prompting
copydeleteadd this publication to your clipboard
2Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
L. Cross, V. Xiang, A. Bhatia, D. Yamins, and N. Haber. (2024)
11 months ago by @ghagerer
show all tags
reinforcement-learning
theory-of-mind
llms
llm-agents
reinforcement-learningtheory-of-mindllmsllm-agents
copydeleteadd this publication to your clipboard
2The Good, The Bad, and Why: Unveiling Emotions in Generative AI
C. LI, J. Wang, Y. Zhang, K. Zhu, X. Wang, W. Hou, J. Lian, F. Luo, Q. Yang, and X. Xie. Forty-first International Conference on Machine Learning, (2024)
12 months ago by @ghagerer
show all tags
emotion
affective-computing
llms
emotionaffective-computingllms
copydeleteadd this publication to your clipboard
2Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
F. Lin, E. Malfa, V. Hofmann, E. Yang, A. Cohn, and J. Pierrehumbert. (2024)
11 months ago by @ghagerer
show all tags
planning
llms
asynchronous
graph-based
planningllmsasynchronousgraph-based
copydeleteadd this publication to your clipboard
1PDFTriage: Question Answering over Long, Structured Documents
J. Saad-Falcon, J. Barrow, A. Siu, A. Nenkova, D. Yoon, R. Rossi, and F. Dernoncourt. (November 2023)
a year ago by @ghagerer
show all tags
rag
llms
structure-based
llm-agents
tool-llms
question-answering
pdftriage
ragllmsstructure-basedllm-agentstool-llmsquestion-answeringpdftriage
copydeleteadd this publication to your clipboard
2FSD: A novel forged document dataset and baseline
A. Jaiswal, S. Singh, and S. Tripathy. 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), page 1-6. IEEE, (July 2023)
10 months ago by @ghagerer
show all tags
computer-vision
dataset
forged-documents
computer-visiondatasetforged-documents
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

BibSonomy

bookmarks (hide)68
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)

1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE

1Wasserstein metric - Wikipedia

1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium

1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb

1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science

1Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org

1Incremental Versioned Datasets In Kedro

1Running dbt in Snowpark: A Step-by-Step Guide - TASMAN

1FastText and Gensim word embeddings | RARE Technologies

1DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

2HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA

2Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs

2Dissecting learning and forgetting in language model finetuning

1Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

2Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

2The Good, The Bad, and Why: Unveiling Emotions in Generative AI

2Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

1PDFTriage: Question Answering over Long, Structured Documents

2FSD: A novel forged document dataset and baseline

Dr. Gerhard Johann Hagerer

discussion

concepts

similar users

shared groups

tags

bookmarks (hide)68 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

Dr. Gerhard Johann Hagerer

discussion

similar users

shared groups

tags

bookmarks (hide)68
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML