Visualizing the most beautiful and famous mathematical theorems with Midjourney

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Visualizing the most beautiful and famous mathematical theorems with Midjourney

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Visualizing the most beautiful and famous mathematical theorems with Midjourney

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

LLMs can do a surprisingly good job even if the text extracted from the PDF isn’t in the right reading order.

Another thing I’ve noticed is that figures are explained thoroughly most of the time in the text so there is no need for the model to see them in order to generate a good summary. Human communication is very redundant and we don’t realize it.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

If I remember correctly, the properties the API returns are comment_score and post_score.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Lemmy does have karma, it is stored in the DB, and the API returns it. It just isn’t displayed on the UI.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

It only handles HTML currently, but I like your idea, thank you! I’ll look into implementing reading PDFs as well. One problem with scientific articles however is that they are often quite long, and they don’t fit into the model’s context. I would need to do recursive summarization, which would use much more tokens, and could become pretty expensive. (Of course, the same problem occurs if a web page is too long; I just truncate it currently which is a rather barbaric solution.)

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

I think the incentives are a bit different here. If we can keep the threadiverse nonprofit, and contribute to the maintenance costs of the servers, it might stay a much friendlier place than Reddit.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Lemmy actually has a really good API. Moderation tools are pretty simple though.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Did I miss something? Or is this still about Beehaw?

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Made the switch 4 years ago. No regrets.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

This asshole fish

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

GITar Hero

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Only on programming.dev, at least in the beginning, but it will be open source so anyone will be able to host it for themselves.
I set up a hard limit of 100 summaries per day to limit costs. This way it won’t go over $20/month. I hope I will be able to increase it later.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · edit-2 1 year ago

This describes 99% of AI startups.

The company I work for was considering using Mendable for AI-powered documentation search. I built a prototype using OpenAI embeddings and GPT-3.5 that was just as good as their product in a day. They didn’t buy Mendable :)

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

First, thank you for the detailed response.

Second, I think you finally convinced me to delete my FB. I will link to this comment wherever possible to show people what a terrible company Meta is.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

After all, they said we need quality content to attract new users

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Fixed

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

They got gregnant

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

When his colon smells so good…

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

When his colon smells so good…

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Never spend 6 minutes doing something by hand when…

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · edit-2 1 year ago

I’m the author of that bot. It will have an opt-out option, I implemented it as soon as someone suggested it:

https://programming.dev/comment/305938

Don’t spread sensationalist lies.

Oh wow, I’ve just realized it was OP I talked to in the comments. I immediately replied to their suggestion. What a clown 🤡

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Can you tell us more about what they are like?

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Thank you, that’s a reasonable suggestion, I added it to the comment template:

TL;DR: (AI-generated 🤖)

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Yes, they have promised explicitly not to use API data for training.

Thank you, I’ll take a look at these models, I hope I can find something a bit cheaper but still high-quality.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

I implemented it. The feature will be available right from the start. The bot will reply this if the user has disabled it:

🔒 The author of this post or comment has the #nobot hashtag in their profile. Out of respect for their privacy settings, I am unable to summarize their posts or comments.

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

This one:

https://github.com/SleeplessOne1917/lemmy-bot

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟@programming.dev · 1 year ago

Oh, I’ve just realized that it’s also possible if the video doesn’t have a transcript. You can download the audio and feed it into OpenAI Whisper (which is currently the best available audio transcription model), and pass the transcript to the LLM. And Whisper isn’t even too expensive.

Not sure about the legality of it though.