ghagerer | BibSonomy

Lesezeichen (verstecken)68
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1Wasserstein metric - Wikipedia
In mathematics, the Wasserstein or Kantorovich–Rubinstein metric or distance is a distance function defined between probability distributions on a given metric space M {\displaystyle M} M. Intuitively, if each distribution is viewed as a unit amount of "dirt" piled on M {\displaystyle M} M, the metric is the minimum "cost" of turning one pile into the other, which is assumed to be the amount of dirt that needs to be moved times the mean distance it has to be moved. Because of this analogy, the metric is known in computer science as the earth mover's distance.
vor 5 Jahren von @ghagerer
alle anzeigen
probability-distribution-similarity
probability-distribution-similarity
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
vor 6 Jahren von @ghagerer
alle anzeigen
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
https://github.com/huggingface/pytorch-transformers
vor 6 Jahren von @ghagerer
alle anzeigen
library
pre-trained
librarypre-trained
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Running dbt in Snowpark: A Step-by-Step Guide - TASMAN
In previous articles, we explored how Snowpark Container Services can open doors to a complete data stack running solely on Snowflake (here) and showcased all essential tools Snowflake provides to achieve this (here). Now, it’s time to dive into the practical side of things. This article will guide you through a step-by-step implementation of running dbt in Snowpark Container Services, covering everything from setup and containerisation all the way to scheduling and monitoring. If you’re trying to create a simple containerised dbt setup, this guide will help you put all theory into action!
vor 8 Monaten von @ghagerer
alle anzeigen
snowflake
snowpark
dbt
snowflakesnowparkdbt
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
vor einem Jahr von @ghagerer
alle anzeigen
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
vor 2 Jahren von @ghagerer
alle anzeigen
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
vor 2 Jahren von @ghagerer
alle anzeigen
precision
recall
metrics
precisionrecallmetrics
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org
We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.
vor 2 Jahren von @ghagerer
alle anzeigen
llama
evaluation
llms
open-source
architecture
deployment
llamaevaluationllmsopen-sourcearchitecturedeployment
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Incremental Versioned Datasets In Kedro
Kedro versioned datasets can be mixed with incremental and partitioned datasets [ �� Unsure what kedro is? Check out this post. This was a question presented to
vor 11 Monaten von @ghagerer
alle anzeigen
kedro
kedro
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Gaussian Mixture Model clustering: how to select the number of components (clusters)
You want to discern how many clusters we have (or, if you prefer, how many gaussians components generated the data), and you don’t have information about the “ground truth”. A real case, where data do not have the nicety of behaving good as the simulated ones.
vor 5 Jahren von @ghagerer
alle anzeigen
optimal-k
clustering
gmms
optimal-kclusteringgmms
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Topic Coherence To Evaluate Topic Models
Definition of NLP coherence scores, in particular intrinsic UMass measure and PMI. Human judgment not being correlated to perplexity (or likelihood of unseen documents) is the motivation for more work trying to model the human judgment. This is by itself a hard task as human judgment is not clearly defined; for example, two experts can disagree on the usefulness of a topic. One can classify the methods addressing this problem into two categories. \textit{Intrinsic} methods that do not use any external source or task from the dataset, whereas \textit{extrinsic} methods use the discovered topics for external tasks, such as information retrieval [Wei06], or use external statistics to evaluate topics.
vor 5 Jahren von @ghagerer
alle anzeigen
coherence
coherence-score
coherencecoherence-score
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction -- documentation
Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualisation similarly to t-SNE, but also for general non-linear dimension reduction. The algorithm is founded on three assumptions about the data The data is uniformly distributed on Riemannian manifold; The Riemannian metric is locally constant (or can be approximated as such); The manifold is locally connected. From these assumptions it is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low dimensional projection of the data that has the closest possible equivalent fuzzy topological structure.
vor 5 Jahren von @ghagerer
alle anzeigen
downprojection
dimension-reduction
exploratory-data-analysis
downprojectiondimension-reductionexploratory-data-analysis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Contextual Topic Identification - Insight Data Science
Identifying meaningful topics for sparse Steam reviews -- by Steve Shao
vor 5 Jahren von @ghagerer
alle anzeigen
umap
auto-encoder
transfer-learning
clustering
bert
topic-modeling
umapauto-encodertransfer-learningclusteringberttopic-modeling
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Topic Modeling with LSA, PLSA, LDA & lda2Vec
In natural language understanding (NLU) tasks, there is a hierarchy of lenses through which we can extract meaning — from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics. The process of learning, recognizing, and extracting these topics across a collection of documents is called topic modeling. In this post, we will explore topic modeling through 4 of the most popular techniques today: LSA, pLSA, LDA, and the newer, deep learning-based lda2vec.
vor 6 Jahren von @ghagerer
alle anzeigen
downprojection
preethi
clustering
lsa
pca
downprojectionpreethiclusteringlsapca
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1BERT for unsupervised text tasks - ETHER Labs - Medium
Document embeddings and sentence relatedness using BERT
vor 6 Jahren von @ghagerer
alle anzeigen
sentence-embeddings
bert
sentence-relatedness
document-embeddings
sentence-embeddingsbertsentence-relatednessdocument-embeddings
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2Learning from Imperfect Annotations: An End-to-End Approach | OpenReview
https://openreview.net/forum?id=rJlVdREKDS
vor 5 Jahren von @ghagerer
alle anzeigen
annotation-bias
computer-vision
crowdsourcing
end-to-end
openreview
annotation-biascomputer-visioncrowdsourcingend-to-endopenreview
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1MACE - Multi-Annotator Competence Estimation
MACE (Multi-Annotator Competence Estimation) is an implementation of an item-response model that let's you evaluate redundant annotations of categorical data. It provides competence estimates of the individual annotators and the most likely answer to each item. If we have 10 annotators answer a question, and five answer with 'yes' and five with 'no' (a surprisingly frequent event), we would normaly have to flip a coin to decide what the right answer is. If we knew, however, that one of the people who answered 'yes' is an expert on the question, while one of the others just alwas selects 'no', we would take this information into account to weight their answers. MACE does exactly that. It tries to find out which annotators are more trustworthy and upweighs their answers. All you need to provide is a CSV file with one item per line. In tests, MACE's trust estimates correlated highly wth the annotators' true competence, and it achieved accuracies of over 0.9 on several test sets. MACE can take annotated items into account, if they are available. This helps to guide the training and improves accuracy.
vor 5 Jahren von @ghagerer
alle anzeigen
annotation-bias
inter-rater-agreement
crowdsourcing
annotation-biasinter-rater-agreementcrowdsourcing
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
vor 6 Jahren von @ghagerer
alle anzeigen
comparison
fasttext
word2vec
comparisonfasttextword2vec
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1What Is XLNet and Why It Outperforms BERT - Towards Data Science
Basic knowledge of XLNet to understand the difference between XLNet and BERT intuitively
vor 6 Jahren von @ghagerer
alle anzeigen
xlnet
bert
xlnetbert
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1About « SenticNet
The main aim of SenticNet is to make the conceptual and affective information conveyed by natural language (meant for human consumption) more easily-accessible to machines.
vor 5 Jahren von @ghagerer
alle anzeigen
senticnet
sentics
sentiment-analysis
senticnetsenticssentiment-analysis
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
2
3
⟩
⟩⟩

Publikationen (verstecken)163
Die gezeigten Posts sind eventuell nicht akkurat bei Änderungen, die vor Kurzem vorgenommen worden. Wollen Sie jedoch akkurate Posts mit eingeschränkten Sortierungsmöglichkeiten, folgen Sie dem folgenden Link.

Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

1Fine-Tuning Language Models from Human Preferences.
D. Ziegler, N. Stiennon, J. Wu, T. Brown, A. Radford, D. Amodei, P. Christiano, und G. Irving. CoRR, (2019)
vor 2 Jahren von @ghagerer
alle anzeigen
codefreeze
reinforcement-learning
ChatGPT
llms
OpenAI
codefreezereinforcement-learningChatGPTllmsOpenAI
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Aspect-Based Sentiment Analysis with Minimal Guidance
H. Zhuang, T. Hanratty, und J. Har. Proceedings of the 2019 SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics, (Mai 2019)
vor 6 Jahren von @ghagerer
alle anzeigen
absa
minimal-guidance
absaminimal-guidance
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Deep Learning for Aspect-Level Sentiment Classification: Survey, Vision, and Challenges.
J. Zhou, J. Huang, Q. Chen, Q. Hu, T. Wang, und L. He. IEEE Access, (2019)
vor 6 Jahren von @ghagerer
alle anzeigen
absa
review
survey
absareviewsurvey
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
H. Zheng, S. Mishra, X. Chen, H. Cheng, E. Chi, Q. Le, und D. Zhou. (2023)
vor 2 Jahren von @ghagerer
alle anzeigen
Google
deep-mind
prompt-engineering
llms
step-back-prompting
Googledeep-mindprompt-engineeringllmsstep-back-prompting
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1Weakly-Supervised Deep Embedding for Product Review Sentiment Analysis.
W. Zhao, Z. Guan, L. Chen, X. He, D. Cai, B. Wang, und Q. Wang. IEEE Trans. Knowl. Data Eng., 30 (1): 185-197 (2018)
vor 6 Jahren von @ghagerer
alle anzeigen
triplet-loss
hierarchical
sentence-embeddings
shabnam
sentiment-analysis
triplet-losshierarchicalsentence-embeddingsshabnamsentiment-analysis
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Dissecting learning and forgetting in language model finetuning
X. Zhang, und J. Wu. The Twelfth International Conference on Learning Representations, (2024)
vor einem Jahr von @ghagerer
alle anzeigen
catastrophic-forgetting
llms
fine-tuning
ablation-study
catastrophic-forgettingllmsfine-tuningablation-study
KopierenLöschenDiese Publikation zur Ablage hinzufügen
3Facial Expression Recognition with Inconsistently Annotated Datasets.
J. Zeng, S. Shan, und X. Chen. ECCV (13), Volume 11217 von Lecture Notes in Computer Science, Seite 227-243. Springer, (2018)
vor 5 Jahren von @ghagerer
alle anzeigen
annotation-bias
computer-vision
crowdsourcing
end-to-end
facial-expression-recognition
weakly-supervised
annotation-biascomputer-visioncrowdsourcingend-to-endfacial-expression-recognitionweakly-supervised
KopierenLöschenDiese Publikation zur Ablage hinzufügen
4Big Bird: Transformers for Longer Sequences.
M. Zaheer, G. Guruganesh, A. Dubey, J. Ainslie, C. Alberti, S. Ontañón, P. Pham, A. Ravula, Q. Wang, L. Yang und 1 andere Autor(en). CoRR, (2020)
vor 5 Jahren von @ghagerer
alle anzeigen
transfer-learning
pre-trained
transformer
bert
transfer-learningpre-trainedtransformerbert
KopierenLöschenDiese Publikation zur Ablage hinzufügen
3Automatic Evaluation of Attribution by Large Language Models
X. Yue, B. Wang, Z. Chen, K. Zhang, Y. Su, und H. Sun. (2023)
vor einem Jahr von @ghagerer
alle anzeigen
attrscore
fact-checking
llms
attribution
metrics
hallucinations
attrscorefact-checkingllmsattributionmetricshallucinations
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension.
A. Yu, D. Dohan, M. Luong, R. Zhao, K. Chen, M. Norouzi, und Q. Le. CoRR, (2018)
vor 5 Jahren von @ghagerer
alle anzeigen
data-augmentation
back-translation
data-augmentationback-translation
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2ReAct: Synergizing Reasoning and Acting in Language Models.
S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, und Y. Cao. (2023)
vor einem Jahr von @ghagerer
alle anzeigen
llms
multi-agent
agents
llmsmulti-agentagents
KopierenLöschenDiese Publikation zur Ablage hinzufügen
7Hierarchical Attention Networks for Document Classification.
Z. Yang, D. Yang, C. Dyer, X. He, A. Smola, und E. Hovy. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seite 1480--1489. (2016)
vor 6 Jahren von @ghagerer
alle anzeigen
anjali
hierarchical
attention
shabnam
anjalihierarchicalattentionshabnam
KopierenLöschenDiese Publikation zur Ablage hinzufügen
7XLNet: Generalized Autoregressive Pretraining for Language Understanding.
Z. Yang, Z. Dai, Y. Yang, J. Carbonell, R. Salakhutdinov, und Q. Le. CoRR, (2019)
vor 6 Jahren von @ghagerer
alle anzeigen
xlnet
pre-trained
contextual-embeddings
xlnetpre-trainedcontextual-embeddings
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1universal-sentence-encoder-xling/en-de | English and German language-agnostic text encoder.
Y. Yang, G. Abrego, S. Yuan, M. Guo, Q. Shen, D. Cer, Y. Sung, B. Strope, und R. Kurzweil. online, (2018)
vor 5 Jahren von @ghagerer
alle anzeigen
xling
cross-lingual
sentence-embeddings
transformer
multi-lingual
xlingcross-lingualsentence-embeddingstransformermulti-lingual
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Leveraging Crowdsourcing Data For Deep Active Learning - An Application: Learning Intents in Alexa.
J. Yang, T. Drake, A. Damianou, und Y. Maarek. WWW, abs/1803.04223, Seite 23-32. ACM, (2018)
vor 5 Jahren von @ghagerer
alle anzeigen
annotation-bias
nlp
crowdsourcing
annotation-biasnlpcrowdsourcing
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Document classification with distributions of word vectors.
C. Xing, D. Wang, X. Zhang, und C. Liu. APSIPA, Seite 1-5. IEEE, (2014)
vor 5 Jahren von @ghagerer
alle anzeigen
supervised
clustering
word-vectors
topic-modeling
gmms
document-embeddings
classification
supervisedclusteringword-vectorstopic-modelinggmmsdocument-embeddingsclassification
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1Ex Machina: Personal Attacks Seen at Scale.
E. Wulczyn, N. Thain, und L. Dixon. CoRR, (2016)
vor 5 Jahren von @ghagerer
alle anzeigen
annotation-bias
wikipedia
annotation-biaswikipedia
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
Q. Wu, G. Bansal, J. Zhang, Y. Wu, B. Li, E. Zhu, L. Jiang, X. Zhang, S. Zhang, J. Liu und 4 andere Autor(en). (2023)
vor einem Jahr von @ghagerer
alle anzeigen
llms
multi-agent
agents
llmsmulti-agentagents
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Understanding and Mitigating Technology-Facilitated Privacy Violations in the Physical World
M. Windl, V. Winterhalter, A. Schmidt, und S. Mayer. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, New York, NY, USA, Association for Computing Machinery, (2023)
vor 2 Jahren von @ghagerer
alle anzeigen
privacy
privacy
KopierenLöschenDiese Publikation zur Ablage hinzufügen
2Efficient Guided Generation for Large Language Models
B. Willard, und R. Louf. (2023)
vor 8 Monaten von @ghagerer
alle anzeigen
parsing
structured-ouput
llms
parsingstructured-ouputllms
KopierenLöschenDiese Publikation zur Ablage hinzufügen

⟨⟨
⟨
1
2
3
⟩
⟩⟩

Lesezeichen (verstecken)68 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Dr. Gerhard Johann Hagerer

Diskussion

Ähnliche Benutzer

gemeinsame Gruppen

Tags

Lesezeichen (verstecken)68
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML