News

This dataset, LAION-400M, contains 413M image-text pairs and has subsequently been used "in many papers and experiments." The new dataset, LAION-5B, was collected using a three-stage pipeline.
Safeguards to keep kids’ data away from AI. When LAION-5B was introduced in spring 2022, it was described as an attempt to replicate OpenAI's dataset and touted as "the largest freely available ...
Databricks Inc. has acquired Lilac AI Inc., a startup with a tool that helps developers manage the text datasets they use in artificial intelligence projects. The companies announced the deal tod.
A good dataset for use in text analysis requires thought and consideration beyond tweaking filters in an application. In this webinar led by Constellate, the text analytics service from the ...