ghagerer | BibSonomy

Lesezeichen (verstecken)64
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
vor 5 Jahren von @ghagerer
alle anzeigen
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Wasserstein metric - Wikipedia
In mathematics, the Wasserstein or Kantorovich–Rubinstein metric or distance is a distance function defined between probability distributions on a given metric space M {\displaystyle M} M. Intuitively, if each distribution is viewed as a unit amount of "dirt" piled on M {\displaystyle M} M, the metric is the minimum "cost" of turning one pile into the other, which is assumed to be the amount of dirt that needs to be moved times the mean distance it has to be moved. Because of this analogy, the metric is known in computer science as the earth mover's distance.
vor 4 Jahren von @ghagerer
alle anzeigen
probability-distribution-similarity
probability-distribution-similarity
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
https://github.com/huggingface/pytorch-transformers
vor 5 Jahren von @ghagerer
alle anzeigen
library
pre-trained
librarypre-trained
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
vor einem Jahr von @ghagerer
alle anzeigen
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
vor einem Jahr von @ghagerer
alle anzeigen
precision
recall
metrics
precisionrecallmetrics
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org
We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.
vor einem Jahr von @ghagerer
alle anzeigen
llama
evaluation
llms
open-source
architecture
deployment
llamaevaluationllmsopen-sourcearchitecturedeployment
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
vor 3 Monaten von @ghagerer
alle anzeigen
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Topic Modeling with LSA, PLSA, LDA & lda2Vec
In natural language understanding (NLU) tasks, there is a hierarchy of lenses through which we can extract meaning — from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics. The process of learning, recognizing, and extracting these topics across a collection of documents is called topic modeling. In this post, we will explore topic modeling through 4 of the most popular techniques today: LSA, pLSA, LDA, and the newer, deep learning-based lda2vec.
vor 5 Jahren von @ghagerer
alle anzeigen
downprojection
preethi
clustering
lsa
pca
downprojectionpreethiclusteringlsapca
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Gaussian Mixture Model clustering: how to select the number of components (clusters)
You want to discern how many clusters we have (or, if you prefer, how many gaussians components generated the data), and you don’t have information about the “ground truth”. A real case, where data do not have the nicety of behaving good as the simulated ones.
vor 4 Jahren von @ghagerer
alle anzeigen
optimal-k
clustering
gmms
optimal-kclusteringgmms
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Topic Coherence To Evaluate Topic Models
Definition of NLP coherence scores, in particular intrinsic UMass measure and PMI. Human judgment not being correlated to perplexity (or likelihood of unseen documents) is the motivation for more work trying to model the human judgment. This is by itself a hard task as human judgment is not clearly defined; for example, two experts can disagree on the usefulness of a topic. One can classify the methods addressing this problem into two categories. \textit{Intrinsic} methods that do not use any external source or task from the dataset, whereas \textit{extrinsic} methods use the discovered topics for external tasks, such as information retrieval [Wei06], or use external statistics to evaluate topics.
vor 4 Jahren von @ghagerer
alle anzeigen
coherence
coherence-score
coherencecoherence-score
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
vor 5 Jahren von @ghagerer
alle anzeigen
comparison
fasttext
word2vec
comparisonfasttextword2vec
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1What Is XLNet and Why It Outperforms BERT - Towards Data Science
Basic knowledge of XLNet to understand the difference between XLNet and BERT intuitively
vor 5 Jahren von @ghagerer
alle anzeigen
xlnet
bert
xlnetbert
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1BERT for unsupervised text tasks - ETHER Labs - Medium
Document embeddings and sentence relatedness using BERT
vor 5 Jahren von @ghagerer
alle anzeigen
sentence-embeddings
bert
sentence-relatedness
document-embeddings
sentence-embeddingsbertsentence-relatednessdocument-embeddings
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1About « SenticNet
The main aim of SenticNet is to make the conceptual and affective information conveyed by natural language (meant for human consumption) more easily-accessible to machines.
vor 4 Jahren von @ghagerer
alle anzeigen
senticnet
sentics
sentiment-analysis
senticnetsenticssentiment-analysis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction -- documentation
Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualisation similarly to t-SNE, but also for general non-linear dimension reduction. The algorithm is founded on three assumptions about the data The data is uniformly distributed on Riemannian manifold; The Riemannian metric is locally constant (or can be approximated as such); The manifold is locally connected. From these assumptions it is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low dimensional projection of the data that has the closest possible equivalent fuzzy topological structure.
vor 4 Jahren von @ghagerer
alle anzeigen
downprojection
dimension-reduction
exploratory-data-analysis
downprojectiondimension-reductionexploratory-data-analysis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Contextual Topic Identification - Insight Data Science
Identifying meaningful topics for sparse Steam reviews -- by Steve Shao
vor 4 Jahren von @ghagerer
alle anzeigen
umap
auto-encoder
transfer-learning
clustering
bert
topic-modeling
umapauto-encodertransfer-learningclusteringberttopic-modeling
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2Learning from Imperfect Annotations: An End-to-End Approach | OpenReview
https://openreview.net/forum?id=rJlVdREKDS
vor 4 Jahren von @ghagerer
alle anzeigen
annotation-bias
computer-vision
crowdsourcing
end-to-end
openreview
annotation-biascomputer-visioncrowdsourcingend-to-endopenreview
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1MACE - Multi-Annotator Competence Estimation
MACE (Multi-Annotator Competence Estimation) is an implementation of an item-response model that let's you evaluate redundant annotations of categorical data. It provides competence estimates of the individual annotators and the most likely answer to each item. If we have 10 annotators answer a question, and five answer with 'yes' and five with 'no' (a surprisingly frequent event), we would normaly have to flip a coin to decide what the right answer is. If we knew, however, that one of the people who answered 'yes' is an expert on the question, while one of the others just alwas selects 'no', we would take this information into account to weight their answers. MACE does exactly that. It tries to find out which annotators are more trustworthy and upweighs their answers. All you need to provide is a CSV file with one item per line. In tests, MACE's trust estimates correlated highly wth the annotators' true competence, and it achieved accuracies of over 0.9 on several test sets. MACE can take annotated items into account, if they are available. This helps to guide the training and improves accuracy.
vor 4 Jahren von @ghagerer
alle anzeigen
annotation-bias
inter-rater-agreement
crowdsourcing
annotation-biasinter-rater-agreementcrowdsourcing
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT
Build document-based question-answering systems using LangChain, Pinecone, LLMs like GPT-4, and semantic search for precise, context-aware AI solutions.
vor einem Jahr von @ghagerer
alle anzeigen
chatbots
llms
langchain
question-answering
chatbotsllmslangchainquestion-answering
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1PCA — how to choose the number of components? | Bartosz Mikulski
In this article, I am going to show you how to choose the number of principal components when using principal component analysis for dimensionality reduction. In the first section, I am going to give you a short answer for those of you who are in a hurry and want to get something working. Later, I am going to provide a more extended explanation for those of you who are interested in understanding PCA.
vor einem Jahr von @ghagerer
alle anzeigen
downprojection
dimension-reduction
unsupervised
pca
hyperparameter-optimization
downprojectiondimension-reductionunsupervisedpcahyperparameter-optimization
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Data Pipeline Design Patterns - #2. Coding patterns in Python · Start Data Engineering
As data engineers, you might have heard the terms functional data pipeline, factory pattern, singleton pattern, etc. One can quickly look up the implementation, but it can be tricky to understand what they are precisely and when to (& when not to) use them. Blindly following a pattern can help in some cases, but not knowing the caveats of a design will lead to hard-to-maintain and brittle code! While writing clean and easy-to-read code takes years of experience, you can accelerate that by understanding the nuances and reasoning behind each pattern. Imagine being able to design an implementation that provides the best extensibility and maintainability! Your colleagues (& future self) will be extremely grateful, your feature delivery speed will increase, and your boss will highly value your opinion. In this post, we will go over the specific code design patterns used for data pipelines, when and why to use them, and when not to use them, and we will also go over a few python specific techniques to help you write better pipelines. By the end of this post, you will be able to identify patterns in your data pipelines and apply the appropriate code design patterns. You will also be able to take advantage of pythonic features to write bug-free, maintainable code that is a joy to work on!
vor 4 Tagen von @ghagerer
alle anzeigen
data-engineering
software-engineering
design-patterns
data-engineeringsoftware-engineeringdesign-patterns
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Module pulearn API documentation
The pulearn Python package provide a collection of scikit-learn wrappers to several positive-unlabled learning (PU-learning) methods. Features Scikit-learn compliant wrappers to prominent PU-learning methods. Fully tested on Linux, macOS and Windows systems. Compatible with Python 3.5+.
vor einem Jahr von @ghagerer
alle anzeigen
python
open-source
pu-learning
pythonopen-sourcepu-learning
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1NLP Profiler - Profiling of Textual Dataset | Kaggle
Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources
vor 8 Monaten von @ghagerer
alle anzeigen
nlp-profiling
python
data-profiler
data-quality
nlp-profilingpythondata-profilerdata-quality
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Building a Private ChatGPT Interface With Azure OpenAI – Baldacchino Automation
https://automation.baldacchino.net/building-a-private-chatgpt-interface-with-azure-openai/
vor 12 Monaten von @ghagerer
alle anzeigen
cloud
llms
gpt3
chatgpt
azure
cloudllmsgpt3chatgptazure
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1LlamaFS: An Open-Source Self-Organizing File system with Llama-3
The recent release of this open-source project, LlamaFS, addresses the challenges associated with traditional file management systems, particularly in the context of overstuffed download folders, inefficient file organization, and the limitations of knowledge-based organization. These issues arise due to the manual nature of file sorting, which often leads to inconsistent structures and difficulty finding specific files. The disorganization in the file system hampers productivity and makes it challenging to locate important files quickly.
vor 2 Monaten von @ghagerer
alle anzeigen
metadata
file-system
llamafs
llms
metadatafile-systemllamafsllms
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Databricks vs. Snowflake: An Honest Comparison in 2024
Jeff from Sync Computing and Ian from SELECT sit down for an hour to discuss Snowflake and Databricks.
vor 11 Tagen von @ghagerer
alle anzeigen
snowflake
comparison
Databricks
datalakes
snowflakecomparisonDatabricksdatalakes
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Exploratory Data Analysis Using D-Tale
D-Tale is an interactive web-based library that consists of a Flask backend and a React front-end serving as an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently, this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.
vor 4 Jahren von @ghagerer
alle anzeigen
pandas
development
python
visualization
data-science
pandasdevelopmentpythonvisualizationdata-science
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1What meaning does the length of a Word2vec vector have? - Stack Overflow
When a word appears in different contexts, its vector gets moved in different directions during updates. The final vector then represents some sort of weighted average over the various contexts. Averaging over vectors that point in different directions typically results in a vector that gets shorter with increasing number of different contexts in which the word appears. For words to be used in many different contexts, they must carry little meaning. Prime examples of such insignificant words are high-frequency stop words, which are indeed represented by short vectors despite their high term frequencies ...
vor 4 Jahren von @ghagerer
alle anzeigen
word-vector-length
word-vectors
word2vec
word-vector-lengthword-vectorsword2vec
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Movie Review Data -- SentiWordNet
This page is a distribution site for movie-review data for use in sentiment-analysis experiments. Available are collections of movie-review documents labeled with respect to their overall sentiment polarity (positive or negative) or subjective rating (e.g., "two and a half stars") and sentences labeled with respect to their subjectivity status (subjective or objective) or polarity. These data sets were introduced in the following papers:
vor 4 Jahren von @ghagerer
alle anzeigen
sentiwordnet
sentiment-analysis
dictionary-based
sentiwordnetsentiment-analysisdictionary-based
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1LMQL is a programming language for LLM interaction. | LMQL
Language Model Query Language
vor 10 Monaten von @ghagerer
alle anzeigen
python
llms
programming
pythonllmsprogramming
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Topic Modeling with Llama 2
In this article, we will explore how we can use Llama2 for Topic Modeling without the need to pass every single document to the model. Instead, we will leverage BERTopic, a modular topic modeling technique that can use any LLM for fine-tuning topic representations.
vor 9 Monaten von @ghagerer
alle anzeigen
llama
llms
bert
topic-modeling
llamallmsberttopic-modeling
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Advanced RAG with Knowledge Graphs (Neo4J demo)
I recently created a demo for some prospective clients of mine, demonstrating how to use Large Language Models (LLMs) together with graph databases like Neo4J. The two have a lot of interesting interactions, namely that you can now create knowledge graphs easier than ever before, by having AI find the graph entities and relationships from your unstructured data, rather than having to do all that manually. On top of that, graph databases also have some advantages for Retrieval Augmented Generation (RAG) applications compared to vector search, which is currently the prevailing approach to RAG.
vor 8 Monaten von @ghagerer
alle anzeigen
youtube
rag
llms
knowledge-graphs
youtuberagllmsknowledge-graphs
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1 Measuring chatbot effectiveness - Visiativ Chatbot Solutions
These measurements are indispensable for tracking the results of your chatbot, identifying any stumbling blocks and continuously improving its performance. But which metrics should you choose?
vor einem Jahr von @ghagerer
alle anzeigen
chatbots
ChatGPT
evaluation
kpis
chatbotsChatGPTevaluationkpis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Perplexity in Language Models. Evaluating language models using the… | by Chiara Campagnola | Towards Data Science
Perplexity is a useful metric to evaluate models in Natural Language Processing (NLP). This article will cover the two ways in which it is normally defined and the intuitions behind them. A language…
vor einem Jahr von @ghagerer
alle anzeigen
perplexity
entropy
llms
metrics
perplexityentropyllmsmetrics
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Dynamic Few-Shot Prompting: Overcoming Context Limit for ChatGPT Text Classification | by Iryna Kondrashchenko | Jun, 2023 | Medium
Recent explosion in the popularity of large language models like ChatGPT has led to their increased usage in classical NLP tasks like language classification. This involves providing a context…
vor einem Jahr von @ghagerer
alle anzeigen
scikit-learn
few-shot
llms
gpt3
zero-shot
classification
scikit-learnfew-shotllmsgpt3zero-shotclassification
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Google "We Have No Moat, And Neither Does OpenAI"
We’ve done a lot of looking over our shoulders at OpenAI. Who will cross the next milestone? What will the next move be? But the uncomfortable truth is, we aren’t positioned to win this arms race and neither is OpenAI. While we’ve been squabbling, a third faction has been quietly eating our lunch. I’m talking, of course, about open source. Plainly put, they are lapping us. Things we consider “major open problems” are solved and in people’s hands today.
vor einem Jahr von @ghagerer
alle anzeigen
openai
LLMs
open-source
google
openaiLLMsopen-sourcegoogle
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Integrating Langchain with Snowflake Cortex | by Bart Wrobel | Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science | Medium
In today’s rapidly evolving tech landscape, the integration of advanced language models with robust data management systems is opening new horizons for data processing and analytics. One of the most…
vor einem Monat von @ghagerer
alle anzeigen
snowflake
llms
langchain
snowflakellmslangchain
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Dagster with User Code Deployments (gRPC) - DEV Community
If you haven't heard about Dagster Dagster is an open source data orchestrator for machine... Tagged with dataengineering, etl, dagster, kubernetes.
vor 13 Tagen von @ghagerer
alle anzeigen
dagster
data-engineering
data-orchestration
dagsterdata-engineeringdata-orchestration
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1The State of Data Engineering 2024
Since 2021 we’ve been releasing the annual State of Data Engineering Report, a compilation of all the relevant categories that have a direct impact on data engineering infrastructure. In 2024, we see 3 primary trends that influence the categories which will be covered in this report.
vor 13 Tagen von @ghagerer
alle anzeigen
data-engineering
lakefs
report
data-engineeringlakefsreport
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2Statistical Significance Tests for Comparing Machine Learning Algorithms
Comparing machine learning methods and selecting a final model is a common operation in applied machine learning. Models are commonly evaluated using resampling methods like k-fold cross-validation from which mean skill scores are calculated and compared directly. Although simple, this approach can be misleading as it is hard to know whether the difference between mean skill scores is real or the result of a statistical fluke.
vor 5 Jahren von @ghagerer
alle anzeigen
machine-learning
statistical-significance-tests
significance-tests
machine-learningstatistical-significance-testssignificance-tests
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Latent Semantic Analysis & Sentiment Classification with Python
Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text. LSA is an information retrieval technique which analyzes and identifies the pattern in unstructured collection of text and the relationship between them. LSA itself is an unsupervised way of uncovering synonyms in a collection of documents. To start, we take a look how Latent Semantic Analysis is used in Natural Language Processing to analyze relationships between a set of documents and the terms that they contain. Then we go steps further to analyze and classify sentiment. We will review Chi Squared for feature selection along the way.
vor 5 Jahren von @ghagerer
alle anzeigen
downprojection
chi-square
clustering
lsa
downprojectionchi-squareclusteringlsa
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Jensen–Shannon divergence - Wikipedia
In probability theory and statistics, the Jensen–Shannon divergence is a method of measuring the similarity between two probability distributions. It is also known as information radius (IRad)[1] or total divergence to the average.[2] It is based on the Kullback–Leibler divergence, with some notable (and useful) differences, including that it is symmetric and it always has a finite value. The square root of the Jensen–Shannon divergence is a metric often referred to as Jensen-Shannon distance.[3][4][5]
vor 4 Jahren von @ghagerer
alle anzeigen
probability-distribution-similarity
probability-distribution-similarity
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Data Augmentation in NLP - Towards Data Science
In natural language processing (NLP) field, it is hard to augmenting text due to high complexity of language. Not every word we can replace it by others such as a, an, the. Also, not every word has synonym. Even changing a word, the context will be totally difference. On the other hand, generating augmented image in computer vision area is relative easier. Even introducing noise or cropping out portion of image, model can still classify the image.
vor 4 Jahren von @ghagerer
alle anzeigen
data-augmentation
data-augmentation
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1The Conversational Intelligence Challenge 2 (ConvAI2) - NIPS (NeurIPS) 2018 Competition
There are currently few datasets appropriate for training and evaluating models for non-goal-oriented dialogue systems (chatbots); and equally problematic, there is currently no standard procedure for evaluating such models beyond the classic Turing test. The aim of our competition is therefore to establish a concrete scenario for testing chatbots that aim to engage humans, and become a standard evaluation tool in order to make such systems directly comparable.
vor 5 Jahren von @ghagerer
alle anzeigen
chatbots
challenge
chatbotschallenge
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1BERT Vector Space shows issues with unknown words · Issue #164 · google-research/bert · GitHub
I'm not sure what these vectors are, since BERT does not generate meaningful sentence vectors. It seems that this is is doing average pooling over the word tokens to get a sentence vector, but we never suggested that this will generate meaningful sentence representations. And even if they are decent representations when fed into a DNN trained for a downstream task, it doesn't mean that they will be meaningful in terms of cosine distance. (Since cosine distance is a linear space where all dimensions are weighted equally).
vor 5 Jahren von @ghagerer
alle anzeigen
cls
sentence-embeddings
bert
clssentence-embeddingsbert
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1How to Use Inaccurate Data for Machine Learning with Weakly Supervised Learning | Lionbridge AI
In this article, we’ll look at Weakly Supervised Learning (WSL), which provides a solution by leveraging “weak” annotations to learn the task. But before we dive deeper into the techniques, it is worth exploring the various types of WSL techniques and the sections we intend to cover in this article.
vor 4 Jahren von @ghagerer
alle anzeigen
noise-aware
weakly-supervised
noise-awareweakly-supervised
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Weak Supervision: A New Programming Paradigm for Machine Learning
In recent years, the real-world impact of machine learning (ML) has grown in leaps and bounds. In large part, this is due to the advent of deep learning models, which allow practitioners to get state-of-the-art scores on benchmark datasets without any hand-engineered features. Given the availability of multiple open-source ML frameworks like TensorFlow and PyTorch, and an abundance of available state-of-the-art models, it can be argued that high-quality ML models are almost a commoditized resource now. There is a hidden catch, however: the reliance of these models on massive sets of hand-labeled training data. These hand-labeled training sets are expensive and time-consuming to create — often requiring person-months or years to assemble, clean, and debug — especially when domain expertise is required. On top of this, tasks often change and evolve in the real world. For example, labeling guidelines, granularities, or downstream use cases often change, necessitating re-labeling (e.g., instead of classifying reviews only as positive or negative, introducing a neutral category). For all these reasons, practitioners have increasingly been turning to weaker forms of supervision, such as heuristically generating training data with external knowledge bases, patterns/rules, or other classifiers. Essentially, these are all ways of programmatically generating training data—or, more succinctly, programming training data. We begin by reviewing areas of ML that are motivated by the problem of labeling training data, and then describe our research on modeling and integrating a diverse set of supervision sources. We also discuss our vision for building data management systems for the massively multi-task regime with tens or hundreds of weakly supervised dynamic tasks interacting in complex and varied ways. Check out the our research blog for detailed discussions of these topics and more!
vor 4 Jahren von @ghagerer
alle anzeigen
labeling-functions
noise-aware
weakly-supervised
labeling-functionsnoise-awareweakly-supervised
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1GitHub - gventuri/pandas-ai
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational - GitHub - gventuri/pandas-ai: Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
vor einem Jahr von @ghagerer
alle anzeigen
module
pandas
Chatgpt
python
LLMs
modulepandasChatgptpythonLLMs
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Chatbot Analytics: 9 Key Metrics You Must Track in 2023
The ultimate guide to chatbot analytics. Find out what bot metrics and KPIs you should measure and discover easy ways to optimize your chatbot performance.
vor einem Jahr von @ghagerer
alle anzeigen
Chatgpt
evaluation
metrics
kpis
Chatgptevaluationmetricskpis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Effective Methods Against LLM Hallucination | by Hooman Sedghamiz | Aug, 2023 | Medium
Large language models (LLMs) have proven to be valuable tools, but they often lack reliability. Many instances have surfaced where LLM-generated responses included false information. Specifically…
vor 10 Monaten von @ghagerer
alle anzeigen
llms
hallucinations
llmshallucinations
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
2
⟩
⟩⟩

Publikationen (verstecken)153
Die gezeigten Posts sind eventuell nicht akkurat bei Änderungen, die vor Kurzem vorgenommen worden. Wollen Sie jedoch akkurate Posts mit eingeschränkten Sortierungsmöglichkeiten, folgen Sie dem folgenden Link.

Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

1Mediale Themensetzung in Zeiten von Web 2.0: Wer beeinflusst wen? Das Agenda-Setting-Verhältnis zwischen Twitter und Online-Leitmedien
S. Abdi-Herrle. Nomos Verlag, (2018)
vor 4 Jahren von @ghagerer
alle anzeigen
rcb
granger-causailty
cross-lagged-correlation
rcbgranger-causailtycross-lagged-correlation
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation.
M. Akhtar, A. Ekbal, und P. Bhattacharyya. LREC, European Language Resources Association (ELRA), (2016)
vor 5 Jahren von @ghagerer
alle anzeigen
inter-rater-agreement
absa
sentiment-analysis
inter-rater-agreementabsasentiment-analysis
KopierenLöschenDiese Publikation zur Ablage hinzufügen
3AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images.
S. Albarqouni, C. Baur, F. Achilles, V. Belagiannis, S. Demirci, und N. Navab. IEEE Trans. Med. Imaging, 35 (5): 1313-1321 (2016)
vor 4 Jahren von @ghagerer
alle anzeigen
annotation-bias
computer-vision
crowdsourcing
end-to-end
annotation-biascomputer-visioncrowdsourcingend-to-end
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Debugging a Crowdsourced Task with Low Inter-Rater Agreement.
O. Alonso, C. Marshall, und M. Najork. JCDL, Seite 101-110. ACM, (2015)
vor 5 Jahren von @ghagerer
alle anzeigen
inter-rater-agreement
crowdsourcing
social-media
microsoft
inter-rater-agreementcrowdsourcingsocial-mediamicrosoft
KopierenLöschenDiese Publikation zur Ablage hinzufügen
4Summarizing Opinions: Aspect Extraction Meets Sentiment Prediction and They Are Both Weakly Supervised.
S. Angelidis, und M. Lapata. CoRR, (2018)
vor 5 Jahren von @ghagerer
alle anzeigen
summarization
attention-based-aspect-extraction
topic-modeling
weakly-supervised
summarizationattention-based-aspect-extractiontopic-modelingweakly-supervised
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Top2Vec: Distributed Representations of Topics
D. Angelov. (2020)cite arxiv:2008.09470Comment: Implementation available at https://github.com/ddangelov/Top2Vec.
vor 4 Jahren von @ghagerer
alle anzeigen
umap
clustering
unsupervised
word2vec
topic-modeling
doc2vec
umapclusteringunsupervisedword2vectopic-modelingdoc2vec
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2On the Equivalence of Inductive Content Analysis and Topic Modeling.
A. Bakharia. ICQE, Volume 1112 von Communications in Computer and Information Science, Seite 291-298. Springer, (2019)
vor 3 Jahren von @ghagerer
alle anzeigen
grounded-theory
content-study
text-mining
topic-modeling
content-analysis
grounded-theorycontent-studytext-miningtopic-modelingcontent-analysis
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1Cross-lingual sentiment analysis for under-resourced languages
J. Barnes, und others. Universitat Pompeu Fabra, (2019)
vor 5 Jahren von @ghagerer
alle anzeigen
aspect-extraction
multi-lingual
sentiment-analysis
phd-thesis
aspect-extractionmulti-lingualsentiment-analysisphd-thesis
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers
R. Barzilay, und M. Kan (Hrsg.) Association for Computational Linguistics, (2017)
vor 4 Jahren von @ghagerer
alle anzeigen
conference
acl
proceedings
conferenceaclproceedings
KopierenLöschenDiese Publikation zur Ablage hinzufügen
6Active Semi-Supervision for Pairwise Constrained Clustering
S. Basu, A. Banerjee, und R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, Seite 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)
vor 5 Jahren von @ghagerer
alle anzeigen
semi-supervised
clustering
pckmeans
kmeans
unsupervised
shabnam
semi-supervisedclusteringpckmeanskmeansunsupervisedshabnam
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison.
C. Baziotis, N. Pelekis, und C. Doulkeridis. Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Seite 390-395. Association for Computational Linguistics, (2017)
vor 5 Jahren von @ghagerer
alle anzeigen
similarity-based
hierarchical
siamese
lstm
similarity-basedhierarchicalsiameselstm
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Longformer: The Long-Document Transformer
I. Beltagy, M. Peters, und A. Cohan. (2020)
vor 4 Jahren von @ghagerer
alle anzeigen
transfer-learning
pre-trained
transformer
bert
transfer-learningpre-trainedtransformerbert
KopierenLöschenDiese Publikation zur Ablage hinzufügen
7A correlated topic model of Science
D. Blei, und J. Lafferty. Annals of Applied Statistics, (2007)
vor 4 Jahren von @ghagerer
alle anzeigen
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
KopierenLöschenDiese Publikation zur Ablage hinzufügen
7Correlated Topic Models
D. Blei, und J. Lafferty. Advances in Neural Information Processing Systems 18, 18, Seite 147. Cambridge, MA, MIT; 1998, (2006)
vor 4 Jahren von @ghagerer
alle anzeigen
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
KopierenLöschenDiese Publikation zur Ablage hinzufügen
43Latent Dirichlet Allocation
D. Blei, A. Ng, und M. Jordan. Journal of Machine Learning Research, (Januar 2003)
vor 4 Jahren von @ghagerer
alle anzeigen
lda
topic-modeling
ldatopic-modeling
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, VS@NAACL-HLT 2015, June 5, 2015, Denver, Colorado, USA
P. Blunsom, S. Cohen, P. Dhillon, und P. Liang (Hrsg.) The Association for Computational Linguistics, (2015)
vor 4 Jahren von @ghagerer
alle anzeigen
workshop
proceedings
workshopproceedings
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Recognizing Subjectivity: A Case Study of Manual Tagging
R. Bruce, und J. Wiebe. Natural Language Engineering, 5 (2): 187-205 (1999)
vor 5 Jahren von @ghagerer
alle anzeigen
inter-rater-agreement
subjectivity-analysis
manual-tagging
case-study
inter-rater-agreementsubjectivity-analysismanual-taggingcase-study
KopierenLöschenDiese Publikation zur Ablage hinzufügen
7Sparks of Artificial General Intelligence: Early experiments with GPT-4
S. Bubeck, V. Chandrasekaran, R. Eldan, J. Gehrke, E. Horvitz, E. Kamar, P. Lee, Y. Lee, Y. Li, S. Lundberg und 4 andere Autor(en). (2023)cite arxiv:2303.12712.
vor einem Jahr von @ghagerer
alle anzeigen
chatbots
Chatgpt
LLMs
Microsoft
chatbotsChatgptLLMsMicrosoft
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1‘We (don’t) know how you feel’--a comparative study of automated vs. manual analysis of social media conversations
A. Canhoto, und Y. Padmanabhan. Journal of Marketing Management, 31 (9-10): 1141--1157 (2015)
vor 5 Jahren von @ghagerer
alle anzeigen
inter-rater-agreement
human-vs-machine
social-media
sentiment-analysis
inter-rater-agreementhuman-vs-machinesocial-mediasentiment-analysis
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Universal Sentence Encoder.
D. Cer, Y. Yang, S. yi Kong, N. Hua, N. Limtiaco, R. John, N. Constant, M. Guajardo-Cespedes, S. Yuan, C. Tar und 3 andere Autor(en). CoRR, (2018)
vor 4 Jahren von @ghagerer
alle anzeigen
pre-trained
sentence-embeddings
transformer
universal-sentence-encoder
pre-trainedsentence-embeddingstransformeruniversal-sentence-encoder
KopierenLöschenDiese Publikation zur Ablage hinzufügen

⟨⟨
⟨
1
2
3
⟩
⟩⟩

Lesezeichen (verstecken)64 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Dr. Gerhard Johann Hagerer

Diskussion

Ähnliche Benutzer

gemeinsame Gruppen

Tags

Lesezeichen (verstecken)64
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML