ghagerer | BibSonomy

bookmarks (hide)64
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1About « SenticNet
The main aim of SenticNet is to make the conceptual and affective information conveyed by natural language (meant for human consumption) more easily-accessible to machines.
4 years ago by @ghagerer
show all tags
senticnet
sentics
sentiment-analysis
senticnetsenticssentiment-analysis
copydelete
- community post
- history of this post
1Advanced RAG with Knowledge Graphs (Neo4J demo)
I recently created a demo for some prospective clients of mine, demonstrating how to use Large Language Models (LLMs) together with graph databases like Neo4J. The two have a lot of interesting interactions, namely that you can now create knowledge graphs easier than ever before, by having AI find the graph entities and relationships from your unstructured data, rather than having to do all that manually. On top of that, graph databases also have some advantages for Retrieval Augmented Generation (RAG) applications compared to vector search, which is currently the prevailing approach to RAG.
9 months ago by @ghagerer
show all tags
youtube
rag
llms
knowledge-graphs
youtuberagllmsknowledge-graphs
copydelete
- community post
- history of this post
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
5 years ago by @ghagerer
show all tags
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
copydelete
- community post
- history of this post
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
a year ago by @ghagerer
show all tags
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
copydelete
- community post
- history of this post
1BERT for unsupervised text tasks - ETHER Labs - Medium
Document embeddings and sentence relatedness using BERT
5 years ago by @ghagerer
show all tags
sentence-embeddings
bert
sentence-relatedness
document-embeddings
sentence-embeddingsbertsentence-relatednessdocument-embeddings
copydelete
- community post
- history of this post
1BERT Vector Space shows issues with unknown words · Issue #164 · google-research/bert · GitHub
I'm not sure what these vectors are, since BERT does not generate meaningful sentence vectors. It seems that this is is doing average pooling over the word tokens to get a sentence vector, but we never suggested that this will generate meaningful sentence representations. And even if they are decent representations when fed into a DNN trained for a downstream task, it doesn't mean that they will be meaningful in terms of cosine distance. (Since cosine distance is a linear space where all dimensions are weighted equally).
5 years ago by @ghagerer
show all tags
cls
sentence-embeddings
bert
clssentence-embeddingsbert
copydelete
- community post
- history of this post
1Bhattacharyya distance - Wikipedia
In statistics, the Bhattacharyya distance measures the similarity of two probability distributions. It is closely related to the Bhattacharyya coefficient which is a measure of the amount of overlap between two statistical samples or populations. Both measures are named after Anil Kumar Bhattacharya, a statistician who worked in the 1930s at the Indian Statistical Institute.[1] The coefficient can be used to determine the relative closeness of the two samples being considered. It is used to measure the separability of classes in classification and it is considered to be more reliable than the Mahalanobis distance, as the Mahalanobis distance is a particular case of the Bhattacharyya distance when the standard deviations of the two classes are the same. Consequently, when two classes have similar means but different standard deviations, the Mahalanobis distance would tend to zero, whereas the Bhattacharyya distance grows depending on the difference between the standard deviations.
4 years ago by @ghagerer
show all tags
probability-distribution-similarity
probability-distribution-similarity
copydelete
- community post
- history of this post
1Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT
Build document-based question-answering systems using LangChain, Pinecone, LLMs like GPT-4, and semantic search for precise, context-aware AI solutions.
a year ago by @ghagerer
show all tags
chatbots
llms
langchain
question-answering
chatbotsllmslangchainquestion-answering
copydelete
- community post
- history of this post
1Building a Private ChatGPT Interface With Azure OpenAI – Baldacchino Automation
https://automation.baldacchino.net/building-a-private-chatgpt-interface-with-azure-openai/
12 months ago by @ghagerer
show all tags
cloud
llms
gpt3
chatgpt
azure
cloudllmsgpt3chatgptazure
copydelete
- community post
- history of this post
1Chatbot Analytics: 9 Key Metrics You Must Track in 2023
The ultimate guide to chatbot analytics. Find out what bot metrics and KPIs you should measure and discover easy ways to optimize your chatbot performance.
a year ago by @ghagerer
show all tags
Chatgpt
evaluation
metrics
kpis
Chatgptevaluationmetricskpis
copydelete
- community post
- history of this post
1Contextual Topic Identification - Insight Data Science
Identifying meaningful topics for sparse Steam reviews -- by Steve Shao
4 years ago by @ghagerer
show all tags
umap
auto-encoder
transfer-learning
clustering
bert
topic-modeling
umapauto-encodertransfer-learningclusteringberttopic-modeling
copydelete
- community post
- history of this post
1Dagster with User Code Deployments (gRPC) - DEV Community
If you haven't heard about Dagster Dagster is an open source data orchestrator for machine... Tagged with dataengineering, etl, dagster, kubernetes.
24 days ago by @ghagerer
show all tags
dagster
data-engineering
data-orchestration
dagsterdata-engineeringdata-orchestration
copydelete
- community post
- history of this post
1Data Augmentation in NLP - Towards Data Science
In natural language processing (NLP) field, it is hard to augmenting text due to high complexity of language. Not every word we can replace it by others such as a, an, the. Also, not every word has synonym. Even changing a word, the context will be totally difference. On the other hand, generating augmented image in computer vision area is relative easier. Even introducing noise or cropping out portion of image, model can still classify the image.
5 years ago by @ghagerer
show all tags
data-augmentation
data-augmentation
copydelete
- community post
- history of this post
1Data Pipeline Design Patterns - #2. Coding patterns in Python · Start Data Engineering
As data engineers, you might have heard the terms functional data pipeline, factory pattern, singleton pattern, etc. One can quickly look up the implementation, but it can be tricky to understand what they are precisely and when to (& when not to) use them. Blindly following a pattern can help in some cases, but not knowing the caveats of a design will lead to hard-to-maintain and brittle code! While writing clean and easy-to-read code takes years of experience, you can accelerate that by understanding the nuances and reasoning behind each pattern. Imagine being able to design an implementation that provides the best extensibility and maintainability! Your colleagues (& future self) will be extremely grateful, your feature delivery speed will increase, and your boss will highly value your opinion. In this post, we will go over the specific code design patterns used for data pipelines, when and why to use them, and when not to use them, and we will also go over a few python specific techniques to help you write better pipelines. By the end of this post, you will be able to identify patterns in your data pipelines and apply the appropriate code design patterns. You will also be able to take advantage of pythonic features to write bug-free, maintainable code that is a joy to work on!
15 days ago by @ghagerer
show all tags
data-engineering
software-engineering
design-patterns
data-engineeringsoftware-engineeringdesign-patterns
copydelete
- community post
- history of this post
1Databricks vs. Snowflake: An Honest Comparison in 2024
Jeff from Sync Computing and Ian from SELECT sit down for an hour to discuss Snowflake and Databricks.
22 days ago by @ghagerer
show all tags
snowflake
comparison
Databricks
datalakes
snowflakecomparisonDatabricksdatalakes
copydelete
- community post
- history of this post
1Dynamic Few-Shot Prompting: Overcoming Context Limit for ChatGPT Text Classification | by Iryna Kondrashchenko | Jun, 2023 | Medium
Recent explosion in the popularity of large language models like ChatGPT has led to their increased usage in classical NLP tasks like language classification. This involves providing a context…
a year ago by @ghagerer
show all tags
scikit-learn
few-shot
llms
gpt3
zero-shot
classification
scikit-learnfew-shotllmsgpt3zero-shotclassification
copydelete
- community post
- history of this post
1Effective Methods Against LLM Hallucination | by Hooman Sedghamiz | Aug, 2023 | Medium
Large language models (LLMs) have proven to be valuable tools, but they often lack reliability. Many instances have surfaced where LLM-generated responses included false information. Specifically…
10 months ago by @ghagerer
show all tags
llms
hallucinations
llmshallucinations
copydelete
- community post
- history of this post
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
3 months ago by @ghagerer
show all tags
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
copydelete
- community post
- history of this post
1Emerging Architectures for LLM Applications | Andreessen Horowitz
A reference architecture for the LLM app stack. It shows the most common systems, tools, and design patterns used by AI startups and tech companies.
a year ago by @ghagerer
show all tags
cloud
llms
langchain
architecture
cloudllmslangchainarchitecture
copydelete
- community post
- history of this post
1Exploratory Data Analysis Using D-Tale
D-Tale is an interactive web-based library that consists of a Flask backend and a React front-end serving as an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently, this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.
4 years ago by @ghagerer
show all tags
pandas
development
python
visualization
data-science
pandasdevelopmentpythonvisualizationdata-science
copydelete
- community post
- history of this post
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
5 years ago by @ghagerer
show all tags
comparison
fasttext
word2vec
comparisonfasttextword2vec
copydelete
- community post
- history of this post
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
a year ago by @ghagerer
show all tags
precision
recall
metrics
precisionrecallmetrics
copydelete
- community post
- history of this post
1Fine Tuning Large Language Models on Azure Machine Learning | by Keshav Singh | Jul, 2023 | Dev Genius
Eversince Nov 2022, as Microsoft and OpenAI accounted ChatGTP the LLM space has been revolutionized and democratized. The demand to adopt the technology and apply it to the diverse use cases across…
a year ago by @ghagerer
show all tags
cloud
llms
huggingface
open-source
fine-tuning
azure
cloudllmshuggingfaceopen-sourcefine-tuningazure
copydelete
- community post
- history of this post
1Gaussian Mixture Model clustering: how to select the number of components (clusters)
You want to discern how many clusters we have (or, if you prefer, how many gaussians components generated the data), and you don’t have information about the “ground truth”. A real case, where data do not have the nicety of behaving good as the simulated ones.
4 years ago by @ghagerer
show all tags
optimal-k
clustering
gmms
optimal-kclusteringgmms
copydelete
- community post
- history of this post
1GitHub - gventuri/pandas-ai
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational - GitHub - gventuri/pandas-ai: Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
a year ago by @ghagerer
show all tags
module
pandas
Chatgpt
python
LLMs
modulepandasChatgptpythonLLMs
copydelete
- community post
- history of this post
1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
https://github.com/huggingface/pytorch-transformers
5 years ago by @ghagerer
show all tags
library
pre-trained
librarypre-trained
copydelete
- community post
- history of this post
3Google "We Have No Moat, And Neither Does OpenAI"
We’ve done a lot of looking over our shoulders at OpenAI. Who will cross the next milestone? What will the next move be? But the uncomfortable truth is, we aren’t positioned to win this arms race and neither is OpenAI. While we’ve been squabbling, a third faction has been quietly eating our lunch. I’m talking, of course, about open source. Plainly put, they are lapping us. Things we consider “major open problems” are solved and in people’s hands today.
a year ago by @ghagerer
show all tags
openai
LLMs
open-source
google
openaiLLMsopen-sourcegoogle
copydelete
- community post
- history of this post
1How to build your own ChatGPT application with Streamlit and deploy it on Azure as Web Application. | by Andreas Hopfgartner | Jun, 2023 | Medium
In the ever-evolving world of technology, natural language processing (NLP) and artificial intelligence (AI) have been turning heads with their jaw-dropping advancements. One of the standout players…
a year ago by @ghagerer
show all tags
cloud
python
chatgpt
deployment
webdev
cloudpythonchatgptdeploymentwebdev
copydelete
- community post
- history of this post
1How to Use Inaccurate Data for Machine Learning with Weakly Supervised Learning | Lionbridge AI
In this article, we’ll look at Weakly Supervised Learning (WSL), which provides a solution by leveraging “weak” annotations to learn the task. But before we dive deeper into the techniques, it is worth exploring the various types of WSL techniques and the sections we intend to cover in this article.
4 years ago by @ghagerer
show all tags
noise-aware
weakly-supervised
noise-awareweakly-supervised
copydelete
- community post
- history of this post
1HuggingFace: OpenChat: Less is More for Open-source Models
OpenChat is a series of open-source language models fine-tuned on a diverse and high-quality dataset of multi-round conversations. With only ~6K GPT-4 conversations filtered from the ~90K ShareGPT conversations, OpenChat is designed to achieve high performance with limited data.
a year ago by @ghagerer
show all tags
HuggingFace
LLMs
open-source
Openchat
HuggingFaceLLMsopen-sourceOpenchat
copydelete
- community post
- history of this post
1Integrating Langchain with Snowflake Cortex | by Bart Wrobel | Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science | Medium
In today’s rapidly evolving tech landscape, the integration of advanced language models with robust data management systems is opening new horizons for data processing and analytics. One of the most…
2 months ago by @ghagerer
show all tags
snowflake
llms
langchain
snowflakellmslangchain
copydelete
- community post
- history of this post
1Intuitive Guide to Latent Dirichlet Allocation | by Thushan Ganegedara | Towards Data Science
Topic modelling refers to the task of identifying topics that best describes a set of documents. These topics will only emerge during the topic modelling process (therefore called latent). And one…
a year ago by @ghagerer
show all tags
unsupervised
tutorial
lda
topic-modeling
unsupervisedtutorialldatopic-modeling
copydelete
- community post
- history of this post
1Jensen–Shannon divergence - Wikipedia
In probability theory and statistics, the Jensen–Shannon divergence is a method of measuring the similarity between two probability distributions. It is also known as information radius (IRad)[1] or total divergence to the average.[2] It is based on the Kullback–Leibler divergence, with some notable (and useful) differences, including that it is symmetric and it always has a finite value. The square root of the Jensen–Shannon divergence is a metric often referred to as Jensen-Shannon distance.[3][4][5]
4 years ago by @ghagerer
show all tags
probability-distribution-similarity
probability-distribution-similarity
copydelete
- community post
- history of this post
1Latent Semantic Analysis & Sentiment Classification with Python
Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text. LSA is an information retrieval technique which analyzes and identifies the pattern in unstructured collection of text and the relationship between them. LSA itself is an unsupervised way of uncovering synonyms in a collection of documents. To start, we take a look how Latent Semantic Analysis is used in Natural Language Processing to analyze relationships between a set of documents and the terms that they contain. Then we go steps further to analyze and classify sentiment. We will review Chi Squared for feature selection along the way.
5 years ago by @ghagerer
show all tags
downprojection
chi-square
clustering
lsa
downprojectionchi-squareclusteringlsa
copydelete
- community post
- history of this post
1Learn Prompting: Your Guide to Communicating with AI
Learn Prompting is the largest and most comprehensive course in prompt engineering available on the internet, with over 60 content modules, translated into 9 languages, and a thriving community.
11 months ago by @ghagerer
show all tags
prompt-engineering
llms
tutorial
prompt-engineeringllmstutorial
copydelete
- community post
- history of this post
2Learning from Imperfect Annotations: An End-to-End Approach | OpenReview
https://openreview.net/forum?id=rJlVdREKDS
4 years ago by @ghagerer
show all tags
annotation-bias
computer-vision
crowdsourcing
end-to-end
openreview
annotation-biascomputer-visioncrowdsourcingend-to-endopenreview
copydelete
- community post
- history of this post
1LlamaFS: An Open-Source Self-Organizing File system with Llama-3
The recent release of this open-source project, LlamaFS, addresses the challenges associated with traditional file management systems, particularly in the context of overstuffed download folders, inefficient file organization, and the limitations of knowledge-based organization. These issues arise due to the manual nature of file sorting, which often leads to inconsistent structures and difficulty finding specific files. The disorganization in the file system hampers productivity and makes it challenging to locate important files quickly.
2 months ago by @ghagerer
show all tags
metadata
file-system
llamafs
llms
metadatafile-systemllamafsllms
copydelete
- community post
- history of this post
1LMQL is a programming language for LLM interaction. | LMQL
Language Model Query Language
10 months ago by @ghagerer
show all tags
python
llms
programming
pythonllmsprogramming
copydelete
- community post
- history of this post
1LMQL is a programming language for LLM interaction. | LMQL
This talk explores the integration of Knowledge Graphs (KGs) and Large Language Models (LLM) to harness their combined power for improved natural language understanding. By leveraging KGs' structured knowledge and language models' text comprehension abilities, we can leverage the domain-specific–and potentially sensitive–data together with the general knowledge of LLMs. We also examine how language models can enhance KGs through knowledge extraction and refinement. The integration of these technologies presents opportunities in various domains, from question-answering to chatbots, fostering more intelligent and context-aware applications.
9 months ago by @ghagerer
show all tags
youtube
rag
llms
knowledge-graphs
youtuberagllmsknowledge-graphs
copydelete
- community post
- history of this post
1MACE - Multi-Annotator Competence Estimation
MACE (Multi-Annotator Competence Estimation) is an implementation of an item-response model that let's you evaluate redundant annotations of categorical data. It provides competence estimates of the individual annotators and the most likely answer to each item. If we have 10 annotators answer a question, and five answer with 'yes' and five with 'no' (a surprisingly frequent event), we would normaly have to flip a coin to decide what the right answer is. If we knew, however, that one of the people who answered 'yes' is an expert on the question, while one of the others just alwas selects 'no', we would take this information into account to weight their answers. MACE does exactly that. It tries to find out which annotators are more trustworthy and upweighs their answers. All you need to provide is a CSV file with one item per line. In tests, MACE's trust estimates correlated highly wth the annotators' true competence, and it achieved accuracies of over 0.9 on several test sets. MACE can take annotated items into account, if they are available. This helps to guide the training and improves accuracy.
4 years ago by @ghagerer
show all tags
annotation-bias
inter-rater-agreement
crowdsourcing
annotation-biasinter-rater-agreementcrowdsourcing
copydelete
- community post
- history of this post
1 Measuring chatbot effectiveness - Visiativ Chatbot Solutions
These measurements are indispensable for tracking the results of your chatbot, identifying any stumbling blocks and continuously improving its performance. But which metrics should you choose?
a year ago by @ghagerer
show all tags
chatbots
ChatGPT
evaluation
kpis
chatbotsChatGPTevaluationkpis
copydelete
- community post
- history of this post
1Module pulearn API documentation
The pulearn Python package provide a collection of scikit-learn wrappers to several positive-unlabled learning (PU-learning) methods. Features Scikit-learn compliant wrappers to prominent PU-learning methods. Fully tested on Linux, macOS and Windows systems. Compatible with Python 3.5+.
a year ago by @ghagerer
show all tags
python
open-source
pu-learning
pythonopen-sourcepu-learning
copydelete
- community post
- history of this post
3Movie Review Data -- SentiWordNet
This page is a distribution site for movie-review data for use in sentiment-analysis experiments. Available are collections of movie-review documents labeled with respect to their overall sentiment polarity (positive or negative) or subjective rating (e.g., "two and a half stars") and sentences labeled with respect to their subjectivity status (subjective or objective) or polarity. These data sets were introduced in the following papers:
4 years ago by @ghagerer
show all tags
sentiwordnet
sentiment-analysis
dictionary-based
sentiwordnetsentiment-analysisdictionary-based
copydelete
- community post
- history of this post
1NLP Profiler - Profiling of Textual Dataset | Kaggle
Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources
9 months ago by @ghagerer
show all tags
nlp-profiling
python
data-profiler
data-quality
nlp-profilingpythondata-profilerdata-quality
copydelete
- community post
- history of this post
1PCA — how to choose the number of components? | Bartosz Mikulski
In this article, I am going to show you how to choose the number of principal components when using principal component analysis for dimensionality reduction. In the first section, I am going to give you a short answer for those of you who are in a hurry and want to get something working. Later, I am going to provide a more extended explanation for those of you who are interested in understanding PCA.
a year ago by @ghagerer
show all tags
downprojection
dimension-reduction
unsupervised
pca
hyperparameter-optimization
downprojectiondimension-reductionunsupervisedpcahyperparameter-optimization
copydelete
- community post
- history of this post
1Perplexity in Language Models. Evaluating language models using the… | by Chiara Campagnola | Towards Data Science
Perplexity is a useful metric to evaluate models in Natural Language Processing (NLP). This article will cover the two ways in which it is normally defined and the intuitions behind them. A language…
a year ago by @ghagerer
show all tags
perplexity
entropy
llms
metrics
perplexityentropyllmsmetrics
copydelete
- community post
- history of this post
1Principal Component Analysis (PCA) with Scikit-learn
At the very beginning of the tutorial, I’ll explain the dimensionality of a dataset, what dimensionality reduction means, the main approaches to dimensionality reduction, the reasons for dimensionality reduction and what PCA means. Then, I will go deeper into the topic of PCA by implementing the PCA algorithm with the Scikit-learn machine learning library. This will help you to easily apply PCA to a real-world dataset and get results very fast.
a year ago by @ghagerer
show all tags
dimension-reduction
unsupervised
pca
hyperparameter-optimization
dimension-reductionunsupervisedpcahyperparameter-optimization
copydelete
- community post
- history of this post
1Python packages for assessing the quality of your data | by Sofia Pinto | Data science at Nesta | Medium
As data scientists, we spend a lot of our time doing exploratory data analysis (EDA), cleaning data and making sure the data we use to generate insights is of good quality. Have you ever found…
9 months ago by @ghagerer
show all tags
pandas
ydata-profiling
data-profiler
data-quality
toolkits
pandasydata-profilingdata-profilerdata-qualitytoolkits
copydelete
- community post
- history of this post
1RetinaNet: how Focal Loss fixes Single-Shot Detection
Why are single-shot methods like SSD and YOLO less accurate than region-proposal methods like R-CNN and how does focal loss fix this problem?
5 years ago by @ghagerer
show all tags
focal-loss
class-imbalance
focal-lossclass-imbalance
copydelete
- community post
- history of this post
1Should I normalize word2vec's word vectors before using them? - Cross Validated
When the downstream applications only care about the direction of the word vectors (e.g. they only pay attention to the cosine similarity of two words), then normalize, and forget about length. However, if the downstream applications are able to (or need to) consider more sensible aspects, such as word significance, or consistency in word usage (see below), then normalization might not be such a good idea.
4 years ago by @ghagerer
show all tags
normalization
word-vectors
word2vec
normalizationword-vectorsword2vec
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

publications (hide)155
This list of posts may not be accurate to recent changes. If you want accurate posts, but with limited sorting follow this link.

display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases
J. Sequeda, D. Allemang, and B. Jacob. (2023)
9 months ago by @ghagerer
show all tags
llms
query-generation
SQL
databases
knowledge-graphs
llmsquery-generationSQLdatabasesknowledge-graphs
copydeleteadd this publication to your clipboard
7A correlated topic model of Science
D. Blei, and J. Lafferty. Annals of Applied Statistics, (2007)
4 years ago by @ghagerer
show all tags
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
copydeleteadd this publication to your clipboard
18A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
B. Pang, and L. Lee. Proceedings of the Association for Computational Linguistics (ACL), page 271--278. Association for Computational Linguistics, (2004)
4 years ago by @ghagerer
show all tags
sentiwordnet
sentiment-analysis
dictionary-based
sentiwordnetsentiment-analysisdictionary-based
copydeleteadd this publication to your clipboard
2A Survey Of Cross-lingual Word Embedding Models
S. Ruder, I. Vulic, and A. Sogaard. (2017)cite arxiv:1706.04902.
5 years ago by @ghagerer
show all tags
cross-lingual
embeddings
survey
cross-lingualembeddingssurvey
copydeleteadd this publication to your clipboard
2A survey on bias and fairness in machine learning
N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan. ACM Computing Surveys (CSUR), 54 (6): 1--35 (2021)
a year ago by @ghagerer
show all tags
bias
fair-ai
biasfair-ai
copydeleteadd this publication to your clipboard
6Active Semi-Supervision for Pairwise Constrained Clustering
S. Basu, A. Banerjee, and R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, page 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)
5 years ago by @ghagerer
show all tags
semi-supervised
clustering
pckmeans
kmeans
unsupervised
shabnam
semi-supervisedclusteringpckmeanskmeansunsupervisedshabnam
copydeleteadd this publication to your clipboard
3ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
L. Rauch, M. Aßenmacher, D. Huseljic, M. Wirth, B. Bischl, and B. Sick. (June 2023)
a year ago by @ghagerer
show all tags
dataset
pre-trained
active-learning
benchmarks
datasetpre-trainedactive-learningbenchmarks
copydeleteadd this publication to your clipboard
24Adam: A Method for Stochastic Optimization.
D. Kingma, and J. Ba. CoRR, (2014)
3 years ago by @ghagerer
show all tags
adam
optimizers
adamoptimizers
copydeleteadd this publication to your clipboard
2Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.
J. Duchi, E. Hazan, and Y. Singer. COLT, page 257-269. Omnipress, (2010)
3 years ago by @ghagerer
show all tags
adagrad
optimizers
adagradoptimizers
copydeleteadd this publication to your clipboard
2Advances in Quantitative Ethnography - First International Conference, ICQE 2019, Madison, WI, USA, October 20-22, 2019, Proceedings
B. Eagan, M. Misfeldt, and A. Siebert-Evenstone (Eds.) volume 1112 of Communications in Computer and Information Science, Springer, (2019)
3 years ago by @ghagerer
show all tags
conference
conference
copydeleteadd this publication to your clipboard
3AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images.
S. Albarqouni, C. Baur, F. Achilles, V. Belagiannis, S. Demirci, and N. Navab. IEEE Trans. Med. Imaging, 35 (5): 1313-1321 (2016)
4 years ago by @ghagerer
show all tags
annotation-bias
computer-vision
crowdsourcing
end-to-end
annotation-biascomputer-visioncrowdsourcingend-to-end
copydeleteadd this publication to your clipboard
2Aggregating and Predicting Sequence Labels from Crowd Annotations.
A. Nguyen, B. Wallace, J. Li, A. Nenkova, and M. Lease. ACL, page 299-309. Association for Computational Linguistics, (2017)
4 years ago by @ghagerer
show all tags
annotation-bias
nlp
crowdsourcing
sequence-labeling
annotation-biasnlpcrowdsourcingsequence-labeling
copydeleteadd this publication to your clipboard
1An analysis of the relationship between individuals? perceptions of privacy and mobile phone location data - a grounded theory study
A. Gorra. Leeds Metropolitan University, (April 2007)
3 years ago by @ghagerer
show all tags
grounded-theory
qualitative-research
grounded-theoryqualitative-research
copydeleteadd this publication to your clipboard
2An online tool for analyzing written student feedback.
N. Grönberg, A. Knutas, T. Hynninen, and M. Hujala. Koli Calling, page 40:1-40:2. ACM, (2020)
4 years ago by @ghagerer
show all tags
palaute
text-mining
education
palautetext-miningeducation
copydeleteadd this publication to your clipboard
4An Unsupervised Neural Attention Model for Aspect Extraction.
R. He, W. Lee, H. Ng, and D. Dahlmeier. ACL (1), page 388-397. Association for Computational Linguistics, (2017)
5 years ago by @ghagerer
show all tags
anjali
clustering
word-vectors
attention-based-aspect-extraction
word2vec
topic-modeling
anjaliclusteringword-vectorsattention-based-aspect-extractionword2vectopic-modeling
copydeleteadd this publication to your clipboard
2Analyzing educational comments for topics and sentiments: A text analytics approach.
G. Nitin, S. Gottipati, and V. Shankararaman. 2015 IEEE Frontiers in Education Conference (FIE), page 1-9. IEEE Computer Society, (2015)
3 years ago by @ghagerer
show all tags
educational
system
sentiment-analysis
topic-modeling
demo
sfms
educationalsystemsentiment-analysistopic-modelingdemosfms
copydeleteadd this publication to your clipboard
4Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned.
E. Voita, D. Talbot, F. Moiseev, R. Sennrich, and I. Titov. (2019)cite arxiv:1905.09418Comment: ACL 2019 (camera-ready).
5 years ago by @ghagerer
show all tags
pruning
transformer
pruningtransformer
copydeleteadd this publication to your clipboard
3Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings.
D. Demszky, N. Garg, R. Voigt, J. Zou, J. Shapiro, M. Gentzkow, and D. Jurafsky. NAACL-HLT (1), abs/1904.01596, page 2970-3005. Association for Computational Linguistics, (2019)
5 years ago by @ghagerer
show all tags
media-agenda-setting
topic-modeling
media-agenda-settingtopic-modeling
copydeleteadd this publication to your clipboard
1‘We (don’t) know how you feel’--a comparative study of automated vs. manual analysis of social media conversations
A. Canhoto, and Y. Padmanabhan. Journal of Marketing Management, 31 (9-10): 1141--1157 (2015)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
human-vs-machine
social-media
sentiment-analysis
inter-rater-agreementhuman-vs-machinesocial-mediasentiment-analysis
copydeleteadd this publication to your clipboard
1“President Vows to Cut <Taxes> Hair”: Dataset and Analysis of Creative Text Editing for Humorous Headlines
N. Hossain, J. Krumm, and M. Gamon. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), page 133--142. (2019)
4 years ago by @ghagerer
show all tags
bag-of-words
bag-of-words
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

bookmarks (hide)64 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

Dr. Gerhard Johann Hagerer

discussion

similar users

shared groups

tags

bookmarks (hide)64
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML