ghagerer | BibSonomy

bookmarks (hide)58
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Intuitive Guide to Latent Dirichlet Allocation | by Thushan Ganegedara | Towards Data Science
Topic modelling refers to the task of identifying topics that best describes a set of documents. These topics will only emerge during the topic modelling process (therefore called latent). And one…
11 months ago by @ghagerer
show all tags
lda
topic-modeling
tutorial
unsupervised
ldatopic-modelingtutorialunsupervised
(0)
copydelete
- community post
- history of this post
1Dynamic Few-Shot Prompting: Overcoming Context Limit for ChatGPT Text Classification | by Iryna Kondrashchenko | Jun, 2023 | Medium
Recent explosion in the popularity of large language models like ChatGPT has led to their increased usage in classical NLP tasks like language classification. This involves providing a context…
11 months ago by @ghagerer
show all tags
classification
few-shot
gpt3
llms
scikit-learn
zero-shot
classificationfew-shotgpt3llmsscikit-learnzero-shot
(0)
copydelete
- community post
- history of this post
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
11 months ago by @ghagerer
show all tags
metrics
precision
recall
metricsprecisionrecall
(0)
copydelete
- community post
- history of this post
1Perplexity in Language Models. Evaluating language models using the… | by Chiara Campagnola | Towards Data Science
Perplexity is a useful metric to evaluate models in Natural Language Processing (NLP). This article will cover the two ways in which it is normally defined and the intuitions behind them. A language…
12 months ago by @ghagerer
show all tags
entropy
llms
metrics
perplexity
entropyllmsmetricsperplexity
(0)
copydelete
- community post
- history of this post
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
12 months ago by @ghagerer
show all tags
deployment
llms
prompt-engineering
deploymentllmsprompt-engineering
(0)
copydelete
- community post
- history of this post
1Chatbot Analytics: 9 Key Metrics You Must Track in 2023
The ultimate guide to chatbot analytics. Find out what bot metrics and KPIs you should measure and discover easy ways to optimize your chatbot performance.
a year ago by @ghagerer
show all tags
Chatgpt
evaluation
kpis
metrics
Chatgptevaluationkpismetrics
(0)
copydelete
- community post
- history of this post
1 Measuring chatbot effectiveness - Visiativ Chatbot Solutions
These measurements are indispensable for tracking the results of your chatbot, identifying any stumbling blocks and continuously improving its performance. But which metrics should you choose?
a year ago by @ghagerer
show all tags
ChatGPT
chatbots
evaluation
kpis
ChatGPTchatbotsevaluationkpis
(0)
copydelete
- community post
- history of this post
3Google "We Have No Moat, And Neither Does OpenAI"
We’ve done a lot of looking over our shoulders at OpenAI. Who will cross the next milestone? What will the next move be? But the uncomfortable truth is, we aren’t positioned to win this arms race and neither is OpenAI. While we’ve been squabbling, a third faction has been quietly eating our lunch. I’m talking, of course, about open source. Plainly put, they are lapping us. Things we consider “major open problems” are solved and in people’s hands today.
a year ago by @ghagerer
show all tags
LLMs
google
open-source
openai
LLMsgoogleopen-sourceopenai
(0)
copydelete
- community post
- history of this post
1GitHub - gventuri/pandas-ai
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational - GitHub - gventuri/pandas-ai: Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
a year ago by @ghagerer
show all tags
Chatgpt
LLMs
module
pandas
python
ChatgptLLMsmodulepandaspython
(0)
copydelete
- community post
- history of this post
1Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT
Build document-based question-answering systems using LangChain, Pinecone, LLMs like GPT-4, and semantic search for precise, context-aware AI solutions.
a year ago by @ghagerer
show all tags
chatbots
langchain
llms
question-answering
chatbotslangchainllmsquestion-answering
(0)
copydelete
- community post
- history of this post
1What meaning does the length of a Word2vec vector have? - Stack Overflow
When a word appears in different contexts, its vector gets moved in different directions during updates. The final vector then represents some sort of weighted average over the various contexts. Averaging over vectors that point in different directions typically results in a vector that gets shorter with increasing number of different contexts in which the word appears. For words to be used in many different contexts, they must carry little meaning. Prime examples of such insignificant words are high-frequency stop words, which are indeed represented by short vectors despite their high term frequencies ...
4 years ago by @ghagerer
show all tags
word-vector-length
word-vectors
word2vec
word-vector-lengthword-vectorsword2vec
(0)
copydelete
- community post
- history of this post
1Should I normalize word2vec's word vectors before using them? - Cross Validated
When the downstream applications only care about the direction of the word vectors (e.g. they only pay attention to the cosine similarity of two words), then normalize, and forget about length. However, if the downstream applications are able to (or need to) consider more sensible aspects, such as word significance, or consistency in word usage (see below), then normalization might not be such a good idea.
4 years ago by @ghagerer
show all tags
normalization
word-vectors
word2vec
normalizationword-vectorsword2vec
(0)
copydelete
- community post
- history of this post
3Movie Review Data -- SentiWordNet
This page is a distribution site for movie-review data for use in sentiment-analysis experiments. Available are collections of movie-review documents labeled with respect to their overall sentiment polarity (positive or negative) or subjective rating (e.g., "two and a half stars") and sentences labeled with respect to their subjectivity status (subjective or objective) or polarity. These data sets were introduced in the following papers:
4 years ago by @ghagerer
show all tags
dictionary-based
sentiment-analysis
sentiwordnet
dictionary-basedsentiment-analysissentiwordnet
(0)
copydelete
- community post
- history of this post
2Learning from Imperfect Annotations: An End-to-End Approach | OpenReview
https://openreview.net/forum?id=rJlVdREKDS
4 years ago by @ghagerer
show all tags
computer-vision
crowdsourcing
end-to-end
openreview
annotation-bias
computer-visioncrowdsourcingend-to-endopenreviewannotation-bias
(1)
copydelete
- community post
- history of this post
1MACE - Multi-Annotator Competence Estimation
MACE (Multi-Annotator Competence Estimation) is an implementation of an item-response model that let's you evaluate redundant annotations of categorical data. It provides competence estimates of the individual annotators and the most likely answer to each item. If we have 10 annotators answer a question, and five answer with 'yes' and five with 'no' (a surprisingly frequent event), we would normaly have to flip a coin to decide what the right answer is. If we knew, however, that one of the people who answered 'yes' is an expert on the question, while one of the others just alwas selects 'no', we would take this information into account to weight their answers. MACE does exactly that. It tries to find out which annotators are more trustworthy and upweighs their answers. All you need to provide is a CSV file with one item per line. In tests, MACE's trust estimates correlated highly wth the annotators' true competence, and it achieved accuracies of over 0.9 on several test sets. MACE can take annotated items into account, if they are available. This helps to guide the training and improves accuracy.
4 years ago by @ghagerer
show all tags
crowdsourcing
inter-rater-agreement
annotation-bias
crowdsourcinginter-rater-agreementannotation-bias
(0)
copydelete
- community post
- history of this post
1Contextual Topic Identification - Insight Data Science
Identifying meaningful topics for sparse Steam reviews -- by Steve Shao
4 years ago by @ghagerer
show all tags
auto-encoder
bert
clustering
topic-modeling
transfer-learning
umap
auto-encoderbertclusteringtopic-modelingtransfer-learningumap
(0)
copydelete
- community post
- history of this post
2UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction -- documentation
Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualisation similarly to t-SNE, but also for general non-linear dimension reduction. The algorithm is founded on three assumptions about the data The data is uniformly distributed on Riemannian manifold; The Riemannian metric is locally constant (or can be approximated as such); The manifold is locally connected. From these assumptions it is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low dimensional projection of the data that has the closest possible equivalent fuzzy topological structure.
4 years ago by @ghagerer
show all tags
dimension-reduction
downprojection
exploratory-data-analysis
dimension-reductiondownprojectionexploratory-data-analysis
(0)
copydelete
- community post
- history of this post
1Exploratory Data Analysis Using D-Tale
D-Tale is an interactive web-based library that consists of a Flask backend and a React front-end serving as an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently, this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.
4 years ago by @ghagerer
show all tags
data-science
development
pandas
python
visualization
data-sciencedevelopmentpandaspythonvisualization
(0)
copydelete
- community post
- history of this post
1Gaussian Mixture Model clustering: how to select the number of components (clusters)
You want to discern how many clusters we have (or, if you prefer, how many gaussians components generated the data), and you don’t have information about the “ground truth”. A real case, where data do not have the nicety of behaving good as the simulated ones.
4 years ago by @ghagerer
show all tags
clustering
gmms
optimal-k
clusteringgmmsoptimal-k
(0)
copydelete
- community post
- history of this post
1Topic Coherence To Evaluate Topic Models
Definition of NLP coherence scores, in particular intrinsic UMass measure and PMI. Human judgment not being correlated to perplexity (or likelihood of unseen documents) is the motivation for more work trying to model the human judgment. This is by itself a hard task as human judgment is not clearly defined; for example, two experts can disagree on the usefulness of a topic. One can classify the methods addressing this problem into two categories. \textit{Intrinsic} methods that do not use any external source or task from the dataset, whereas \textit{extrinsic} methods use the discovered topics for external tasks, such as information retrieval [Wei06], or use external statistics to evaluate topics.
4 years ago by @ghagerer
show all tags
coherence
coherence-score
coherencecoherence-score
(0)
copydelete
- community post
- history of this post

publications (hide)148
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
Q. Wu, G. Bansal, J. Zhang, Y. Wu, B. Li, E. Zhu, L. Jiang, X. Zhang, S. Zhang, J. Liu and 4 other author(s). (2023)
14 days ago by @ghagerer
show all tags
agents
llms
multi-agent
agentsllmsmulti-agent
(0)
copydeleteadd this publication to your clipboard
1On the portability of extractive Question-Answering systems on scientific papers to real-life application scenarios
C. Tahri, X. Tannier, and P. Haouat. Proceedings of the first Workshop on Information Extraction from Scientific Publications, page 67--77. Online, Association for Computational Linguistics, (November 2022)
15 days ago by @ghagerer
show all tags
extractive
question-answering
retriever
extractivequestion-answeringretriever
(0)
copydeleteadd this publication to your clipboard
3Automatic Evaluation of Attribution by Large Language Models
X. Yue, B. Wang, Z. Chen, K. Zhang, Y. Su, and H. Sun. (2023)
16 days ago by @ghagerer
show all tags
attribution
fact-checking
hallucinations
llms
metrics
attributionfact-checkinghallucinationsllmsmetrics
(0)
copydeleteadd this publication to your clipboard
1PDFTriage: Question Answering over Long, Structured Documents
J. Saad-Falcon, J. Barrow, A. Siu, A. Nenkova, D. Yoon, R. Rossi, and F. Dernoncourt. (November 2023)
a month ago by @ghagerer
show all tags
llm-agents
llms
question-answering
rag
structure-based
tool-llms
llm-agentsllmsquestion-answeringragstructure-basedtool-llms
(0)
copydeleteadd this publication to your clipboard
2Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
M. Suzgun, and A. Kalai. (2024)
4 months ago by @ghagerer
show all tags
llms
meta-prompting
prompt-engineering
llmsmeta-promptingprompt-engineering
(0)
copydeleteadd this publication to your clipboard
2Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog.
N. Jaques, A. Ghandeharioun, J. Shen, C. Ferguson, À. Lapedriza, N. Jones, S. Gu, and R. Picard. CoRR, (2019)
5 months ago by @ghagerer
show all tags
codefreeze
llms
reinforcement-learning
codefreezellmsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
7Improving language understanding by generative pre-training
A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever. (2018)
5 months ago by @ghagerer
show all tags
chatgpt
codefreeze
llms
openai
reinforcement-learning
chatgptcodefreezellmsopenaireinforcement-learning
(1)
copydeleteadd this publication to your clipboard
2Fine-Tuning Language Models from Human Preferences.
D. Ziegler, N. Stiennon, J. Wu, T. Brown, A. Radford, D. Amodei, P. Christiano, and G. Irving. CoRR, (2019)
5 months ago by @ghagerer
show all tags
ChatGPT
OpenAI
codefreeze
llms
reinforcement-learning
ChatGPTOpenAIcodefreezellmsreinforcement-learning
(0)
copydeleteadd this publication to your clipboard
2Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
H. Zheng, S. Mishra, X. Chen, H. Cheng, E. Chi, Q. Le, and D. Zhou. (2023)
6 months ago by @ghagerer
show all tags
Google
deep-mind
llms
prompt-engineering
step-back-prompting
Googledeep-mindllmsprompt-engineeringstep-back-prompting
(0)
copydeleteadd this publication to your clipboard
2Understanding and Mitigating Technology-Facilitated Privacy Violations in the Physical World
M. Windl, V. Winterhalter, A. Schmidt, and S. Mayer. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, New York, NY, USA, Association for Computing Machinery, (2023)
6 months ago by @ghagerer
show all tags
privacy
privacy
(0)
copydeleteadd this publication to your clipboard
2A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases
J. Sequeda, D. Allemang, and B. Jacob. (2023)
6 months ago by @ghagerer
show all tags
SQL
databases
knowledge-graphs
llms
query-generation
SQLdatabasesknowledge-graphsllmsquery-generation
(0)
copydeleteadd this publication to your clipboard
2Unifying Large Language Models and Knowledge Graphs: A Roadmap
S. Pan, L. Luo, Y. Wang, C. Chen, J. Wang, and X. Wu. (2023)
6 months ago by @ghagerer
show all tags
knowledge-graphs
llms
knowledge-graphsllms
(0)
copydeleteadd this publication to your clipboard
4Learning to summarize from human feedback.
N. Stiennon, L. Ouyang, J. Wu, D. Ziegler, R. Lowe, C. Voss, A. Radford, D. Amodei, and P. Christiano. CoRR, (2020)
6 months ago by @ghagerer
show all tags
ChatGPT
OpenAI
abstractive
llms
reinforcement-learning
summarization
ChatGPTOpenAIabstractivellmsreinforcement-learningsummarization
(0)
copydeleteadd this publication to your clipboard
2Beyond Bag-of-Concepts: Vectors of Locally Aggregated Concepts.
M. Grootendorst, and J. Vanschoren. ECML/PKDD (2), volume 11907 of Lecture Notes in Computer Science, page 681-696. Springer, (2019)
10 months ago by @ghagerer
show all tags
bag-of-concepts
clustering
document-embeddings
topic-modeling
unsupervised
bag-of-conceptsclusteringdocument-embeddingstopic-modelingunsupervised
(0)
copydeleteadd this publication to your clipboard
2Artificial intelligence for topic modelling in Hindu philosophy: Mapping themes between the Upanishads and the Bhagavad Gita
R. Chandra, and M. Ranjan. PLOS ONE, 17 (9): 1-34 (September 2022)
11 months ago by @ghagerer
show all tags
descriptive
hindu-texts
topic-modeling
descriptivehindu-textstopic-modeling
(0)
copydeleteadd this publication to your clipboard
2Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities
X. Shi, W. Cao, and S. Raschka. CoRR, (2021)
11 months ago by @ghagerer
show all tags
neural-networks
ordinal
rank-based
neural-networksordinalrank-based
(0)
copydeleteadd this publication to your clipboard
4ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
L. Rauch, M. Aßenmacher, D. Huseljic, M. Wirth, B. Bischl, and B. Sick. (June 2023)
11 months ago by @ghagerer
show all tags
active-learning
benchmarks
dataset
pre-trained
active-learningbenchmarksdatasetpre-trained
(0)
copydeleteadd this publication to your clipboard
3Scaling Laws for Neural Language Models.
J. Kaplan, S. McCandlish, T. Henighan, T. Brown, B. Chess, R. Child, S. Gray, A. Radford, J. Wu, and D. Amodei. CoRR, (2020)
12 months ago by @ghagerer
show all tags
llms
lstm
transformer
llmslstmtransformer
(0)
copydeleteadd this publication to your clipboard
3GLTR: Statistical Detection and Visualization of Generated Text.
S. Gehrmann, H. Strobelt, and A. Rush. CoRR, (2019)
12 months ago by @ghagerer
show all tags
demonstration
evaluation
llms
visualization
demonstrationevaluationllmsvisualization
(0)
copydeleteadd this publication to your clipboard
6LoRA: Low-Rank Adaptation of Large Language Models
E. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, and W. Chen. (2021)cite arxiv:2106.09685Comment: Draft V2 includes better baselines, experiments on GLUE, and more on adapter latency.
a year ago by @ghagerer
show all tags
fine-tuning
llms
transfer-learning
transformer
fine-tuningllmstransfer-learningtransformer
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

bookmarks (hide)58 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)148 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

Dr. Gerhard Johann Hagerer

discussion

similar users

shared groups

tags

bookmarks (hide)58
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)148
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...