ghagerer | BibSonomy

bookmarks (hide)58
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1About « SenticNet
The main aim of SenticNet is to make the conceptual and affective information conveyed by natural language (meant for human consumption) more easily-accessible to machines.
4 years ago by @ghagerer
show all tags
senticnet
sentics
sentiment-analysis
senticnetsenticssentiment-analysis
copydelete
- community post
- history of this post
1Advanced RAG with Knowledge Graphs (Neo4J demo)
I recently created a demo for some prospective clients of mine, demonstrating how to use Large Language Models (LLMs) together with graph databases like Neo4J. The two have a lot of interesting interactions, namely that you can now create knowledge graphs easier than ever before, by having AI find the graph entities and relationships from your unstructured data, rather than having to do all that manually. On top of that, graph databases also have some advantages for Retrieval Augmented Generation (RAG) applications compared to vector search, which is currently the prevailing approach to RAG.
6 months ago by @ghagerer
show all tags
youtube
rag
llms
knowledge-graphs
youtuberagllmsknowledge-graphs
copydelete
- community post
- history of this post
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
5 years ago by @ghagerer
show all tags
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
copydelete
- community post
- history of this post
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
12 months ago by @ghagerer
show all tags
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
copydelete
- community post
- history of this post
1BERT for unsupervised text tasks - ETHER Labs - Medium
Document embeddings and sentence relatedness using BERT
4 years ago by @ghagerer
show all tags
sentence-embeddings
bert
sentence-relatedness
document-embeddings
sentence-embeddingsbertsentence-relatednessdocument-embeddings
copydelete
- community post
- history of this post
1BERT Vector Space shows issues with unknown words · Issue #164 · google-research/bert · GitHub
I'm not sure what these vectors are, since BERT does not generate meaningful sentence vectors. It seems that this is is doing average pooling over the word tokens to get a sentence vector, but we never suggested that this will generate meaningful sentence representations. And even if they are decent representations when fed into a DNN trained for a downstream task, it doesn't mean that they will be meaningful in terms of cosine distance. (Since cosine distance is a linear space where all dimensions are weighted equally).
5 years ago by @ghagerer
show all tags
cls
sentence-embeddings
bert
clssentence-embeddingsbert
copydelete
- community post
- history of this post
1Bhattacharyya distance - Wikipedia
In statistics, the Bhattacharyya distance measures the similarity of two probability distributions. It is closely related to the Bhattacharyya coefficient which is a measure of the amount of overlap between two statistical samples or populations. Both measures are named after Anil Kumar Bhattacharya, a statistician who worked in the 1930s at the Indian Statistical Institute.[1] The coefficient can be used to determine the relative closeness of the two samples being considered. It is used to measure the separability of classes in classification and it is considered to be more reliable than the Mahalanobis distance, as the Mahalanobis distance is a particular case of the Bhattacharyya distance when the standard deviations of the two classes are the same. Consequently, when two classes have similar means but different standard deviations, the Mahalanobis distance would tend to zero, whereas the Bhattacharyya distance grows depending on the difference between the standard deviations.
4 years ago by @ghagerer
show all tags
probability-distribution-similarity
probability-distribution-similarity
copydelete
- community post
- history of this post
1Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT
Build document-based question-answering systems using LangChain, Pinecone, LLMs like GPT-4, and semantic search for precise, context-aware AI solutions.
a year ago by @ghagerer
show all tags
chatbots
llms
langchain
question-answering
chatbotsllmslangchainquestion-answering
copydelete
- community post
- history of this post
1Building a Private ChatGPT Interface With Azure OpenAI – Baldacchino Automation
https://automation.baldacchino.net/building-a-private-chatgpt-interface-with-azure-openai/
9 months ago by @ghagerer
show all tags
cloud
llms
gpt3
chatgpt
azure
cloudllmsgpt3chatgptazure
copydelete
- community post
- history of this post
1Chatbot Analytics: 9 Key Metrics You Must Track in 2023
The ultimate guide to chatbot analytics. Find out what bot metrics and KPIs you should measure and discover easy ways to optimize your chatbot performance.
12 months ago by @ghagerer
show all tags
Chatgpt
evaluation
metrics
kpis
Chatgptevaluationmetricskpis
copydelete
- community post
- history of this post
1Contextual Topic Identification - Insight Data Science
Identifying meaningful topics for sparse Steam reviews -- by Steve Shao
4 years ago by @ghagerer
show all tags
umap
auto-encoder
transfer-learning
clustering
bert
topic-modeling
umapauto-encodertransfer-learningclusteringberttopic-modeling
copydelete
- community post
- history of this post
1Data Augmentation in NLP - Towards Data Science
In natural language processing (NLP) field, it is hard to augmenting text due to high complexity of language. Not every word we can replace it by others such as a, an, the. Also, not every word has synonym. Even changing a word, the context will be totally difference. On the other hand, generating augmented image in computer vision area is relative easier. Even introducing noise or cropping out portion of image, model can still classify the image.
4 years ago by @ghagerer
show all tags
data-augmentation
data-augmentation
copydelete
- community post
- history of this post
1Dynamic Few-Shot Prompting: Overcoming Context Limit for ChatGPT Text Classification | by Iryna Kondrashchenko | Jun, 2023 | Medium
Recent explosion in the popularity of large language models like ChatGPT has led to their increased usage in classical NLP tasks like language classification. This involves providing a context…
11 months ago by @ghagerer
show all tags
scikit-learn
few-shot
llms
gpt3
zero-shot
classification
scikit-learnfew-shotllmsgpt3zero-shotclassification
copydelete
- community post
- history of this post
1Effective Methods Against LLM Hallucination | by Hooman Sedghamiz | Aug, 2023 | Medium
Large language models (LLMs) have proven to be valuable tools, but they often lack reliability. Many instances have surfaced where LLM-generated responses included false information. Specifically…
7 months ago by @ghagerer
show all tags
llms
hallucinations
llmshallucinations
copydelete
- community post
- history of this post
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
14 days ago by @ghagerer
show all tags
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
copydelete
- community post
- history of this post
1Emerging Architectures for LLM Applications | Andreessen Horowitz
A reference architecture for the LLM app stack. It shows the most common systems, tools, and design patterns used by AI startups and tech companies.
10 months ago by @ghagerer
show all tags
cloud
llms
langchain
architecture
cloudllmslangchainarchitecture
copydelete
- community post
- history of this post
1Exploratory Data Analysis Using D-Tale
D-Tale is an interactive web-based library that consists of a Flask backend and a React front-end serving as an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently, this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.
4 years ago by @ghagerer
show all tags
pandas
development
python
visualization
data-science
pandasdevelopmentpythonvisualizationdata-science
copydelete
- community post
- history of this post
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
5 years ago by @ghagerer
show all tags
comparison
fasttext
word2vec
comparisonfasttextword2vec
copydelete
- community post
- history of this post
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
11 months ago by @ghagerer
show all tags
precision
recall
metrics
precisionrecallmetrics
copydelete
- community post
- history of this post
1Fine Tuning Large Language Models on Azure Machine Learning | by Keshav Singh | Jul, 2023 | Dev Genius
Eversince Nov 2022, as Microsoft and OpenAI accounted ChatGTP the LLM space has been revolutionized and democratized. The demand to adopt the technology and apply it to the diverse use cases across…
11 months ago by @ghagerer
show all tags
cloud
llms
huggingface
open-source
fine-tuning
azure
cloudllmshuggingfaceopen-sourcefine-tuningazure
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)148
This list of posts may not be accurate to recent changes. If you want accurate posts, but with limited sorting follow this link.

display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases
J. Sequeda, D. Allemang, and B. Jacob. (2023)
6 months ago by @ghagerer
show all tags
llms
query-generation
SQL
databases
knowledge-graphs
llmsquery-generationSQLdatabasesknowledge-graphs
copydeleteadd this publication to your clipboard
7A correlated topic model of Science
D. Blei, and J. Lafferty. Annals of Applied Statistics, (2007)
4 years ago by @ghagerer
show all tags
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
copydeleteadd this publication to your clipboard
18A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
B. Pang, and L. Lee. Proceedings of the Association for Computational Linguistics (ACL), page 271--278. Association for Computational Linguistics, (2004)
4 years ago by @ghagerer
show all tags
sentiwordnet
sentiment-analysis
dictionary-based
sentiwordnetsentiment-analysisdictionary-based
copydeleteadd this publication to your clipboard
2A Survey Of Cross-lingual Word Embedding Models
S. Ruder, I. Vulic, and A. Sogaard. (2017)cite arxiv:1706.04902.
5 years ago by @ghagerer
show all tags
cross-lingual
embeddings
survey
cross-lingualembeddingssurvey
copydeleteadd this publication to your clipboard
2A survey on bias and fairness in machine learning
N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan. ACM Computing Surveys (CSUR), 54 (6): 1--35 (2021)
a year ago by @ghagerer
show all tags
bias
fair-ai
biasfair-ai
copydeleteadd this publication to your clipboard
6Active Semi-Supervision for Pairwise Constrained Clustering
S. Basu, A. Banerjee, and R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, page 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)
5 years ago by @ghagerer
show all tags
semi-supervised
clustering
pckmeans
kmeans
unsupervised
shabnam
semi-supervisedclusteringpckmeanskmeansunsupervisedshabnam
copydeleteadd this publication to your clipboard
4ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
L. Rauch, M. Aßenmacher, D. Huseljic, M. Wirth, B. Bischl, and B. Sick. (June 2023)
11 months ago by @ghagerer
show all tags
dataset
pre-trained
active-learning
benchmarks
datasetpre-trainedactive-learningbenchmarks
copydeleteadd this publication to your clipboard
24Adam: A Method for Stochastic Optimization.
D. Kingma, and J. Ba. CoRR, (2014)
3 years ago by @ghagerer
show all tags
adam
optimizers
adamoptimizers
copydeleteadd this publication to your clipboard
2Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.
J. Duchi, E. Hazan, and Y. Singer. COLT, page 257-269. Omnipress, (2010)
3 years ago by @ghagerer
show all tags
adagrad
optimizers
adagradoptimizers
copydeleteadd this publication to your clipboard
2Advances in Quantitative Ethnography - First International Conference, ICQE 2019, Madison, WI, USA, October 20-22, 2019, Proceedings
B. Eagan, M. Misfeldt, and A. Siebert-Evenstone (Eds.) volume 1112 of Communications in Computer and Information Science, Springer, (2019)
3 years ago by @ghagerer
show all tags
conference
conference
copydeleteadd this publication to your clipboard
3AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images.
S. Albarqouni, C. Baur, F. Achilles, V. Belagiannis, S. Demirci, and N. Navab. IEEE Trans. Med. Imaging, 35 (5): 1313-1321 (2016)
4 years ago by @ghagerer
show all tags
annotation-bias
computer-vision
crowdsourcing
end-to-end
annotation-biascomputer-visioncrowdsourcingend-to-end
copydeleteadd this publication to your clipboard
2Aggregating and Predicting Sequence Labels from Crowd Annotations.
A. Nguyen, B. Wallace, J. Li, A. Nenkova, and M. Lease. ACL, page 299-309. Association for Computational Linguistics, (2017)
4 years ago by @ghagerer
show all tags
annotation-bias
nlp
crowdsourcing
sequence-labeling
annotation-biasnlpcrowdsourcingsequence-labeling
copydeleteadd this publication to your clipboard
1An analysis of the relationship between individuals? perceptions of privacy and mobile phone location data - a grounded theory study
A. Gorra. Leeds Metropolitan University, (April 2007)
3 years ago by @ghagerer
show all tags
grounded-theory
qualitative-research
grounded-theoryqualitative-research
copydeleteadd this publication to your clipboard
2An online tool for analyzing written student feedback.
N. Grönberg, A. Knutas, T. Hynninen, and M. Hujala. Koli Calling, page 40:1-40:2. ACM, (2020)
3 years ago by @ghagerer
show all tags
palaute
text-mining
education
palautetext-miningeducation
copydeleteadd this publication to your clipboard
4An Unsupervised Neural Attention Model for Aspect Extraction.
R. He, W. Lee, H. Ng, and D. Dahlmeier. ACL (1), page 388-397. Association for Computational Linguistics, (2017)
5 years ago by @ghagerer
show all tags
anjali
clustering
word-vectors
attention-based-aspect-extraction
word2vec
topic-modeling
anjaliclusteringword-vectorsattention-based-aspect-extractionword2vectopic-modeling
copydeleteadd this publication to your clipboard
2Analyzing educational comments for topics and sentiments: A text analytics approach.
G. Nitin, S. Gottipati, and V. Shankararaman. 2015 IEEE Frontiers in Education Conference (FIE), page 1-9. IEEE Computer Society, (2015)
3 years ago by @ghagerer
show all tags
educational
system
sentiment-analysis
topic-modeling
demo
sfms
educationalsystemsentiment-analysistopic-modelingdemosfms
copydeleteadd this publication to your clipboard
4Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned.
E. Voita, D. Talbot, F. Moiseev, R. Sennrich, and I. Titov. (2019)cite arxiv:1905.09418Comment: ACL 2019 (camera-ready).
5 years ago by @ghagerer
show all tags
pruning
transformer
pruningtransformer
copydeleteadd this publication to your clipboard
3Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings.
D. Demszky, N. Garg, R. Voigt, J. Zou, J. Shapiro, M. Gentzkow, and D. Jurafsky. NAACL-HLT (1), abs/1904.01596, page 2970-3005. Association for Computational Linguistics, (2019)
5 years ago by @ghagerer
show all tags
media-agenda-setting
topic-modeling
media-agenda-settingtopic-modeling
copydeleteadd this publication to your clipboard
1‘We (don’t) know how you feel’--a comparative study of automated vs. manual analysis of social media conversations
A. Canhoto, and Y. Padmanabhan. Journal of Marketing Management, 31 (9-10): 1141--1157 (2015)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
human-vs-machine
social-media
sentiment-analysis
inter-rater-agreementhuman-vs-machinesocial-mediasentiment-analysis
copydeleteadd this publication to your clipboard
1“President Vows to Cut <Taxes> Hair”: Dataset and Analysis of Creative Text Editing for Humorous Headlines
N. Hossain, J. Krumm, and M. Gamon. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), page 133--142. (2019)
4 years ago by @ghagerer
show all tags
bag-of-words
bag-of-words
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

bookmarks (hide)58 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

Dr. Gerhard Johann Hagerer

discussion

similar users

shared groups

tags

bookmarks (hide)58
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML