nsa

@ nsa @kbin.social

Posts

13
Comments

8
Joined

3 yr. ago

Machine Learning @kbin.social
nsa @kbin.social
3y ago

What's In My Big Data?

arxiv.org /abs/2310.20707

0
Machine Learning @kbin.social
nsa @kbin.social
3y ago

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

arxiv.org /abs/2310.16787

0
Machine Learning @kbin.social
nsa @kbin.social
3y ago

GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems

arxiv.org /abs/2310.12397

0
Machine Learning @kbin.social
nsa @kbin.social
3y ago

A Long Way to Go: Investigating Length Correlations in RLHF

arxiv.org /abs/2310.03716

0
Machine Learning @kbin.social
nsa @kbin.social
3y ago

Think before you speak: Training Language Models With Pause Tokens

arxiv.org /abs/2310.02226

1
Machine Learning @kbin.social
nsa @kbin.social
3y ago

Language Modeling Is Compression

arxiv.org /abs/2309.10668

0
Machine Learning @kbin.social
nsa @kbin.social
3y ago

Retentive Network: A Successor to Transformer for Large Language Models

arxiv.org /abs/2307.08621

4
Machine Learning @kbin.social
nsa @kbin.social
3y ago

CoDi: Generate Anything from Anything All At Once through Composable Diffusion

codi-gen.github.io

0
3y ago

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing
Jump
nsa @kbin.social 3y ago
That's appreciated!

3y ago

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

If there isn't any discussion on reddit (no discussion in this case), I don't see a reason to link to reddit; you can just link to the project page. That said, if you think there is important discussion happening that is helpful for understanding the paper, then use a teddit link instead, like:

https://teddit.net/r/MachineLearning/comments/14pq5mq/r_hardwiring_vit_patch_selectivity_into_cnns/

nsa

@ nsa @kbin.social

Posts

13
Comments

8
Joined

3 yr. ago

nsa

What's In My Big Data?

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems

A Long Way to Go: Investigating Length Correlations in RLHF

Think before you speak: Training Language Models With Pause Tokens

Language Modeling Is Compression

Retentive Network: A Successor to Transformer for Large Language Models

CoDi: Generate Anything from Anything All At Once through Composable Diffusion

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Extending Context Window of Large Language Models via Positional Interpolation

Extending Context Window of Large Language Models via Positional Interpolation

@machinelearning am I in the right place? Lol

Extending Context Window of Large Language Models via Positional Interpolation

Inverse Scaling: When Bigger Isn't Better

Craft an Iron Sword: Dynamically Generating Interactive Game Characters by Prompting Large Language Models Tuned on Code

r/MachineLearning finally received a warning from u/ModCodeOfConduct

The Curse of Recursion: Training on Generated Data Makes Models Forget