John Snow Labs Releases Spark-NLP 3.4.0
Jan. 6, 2022 -- John Snow Labs is excited to release Spark NLP 3.4.0.
Spark NLP 3.4.0 extends the support for Apache Spark 3.2.x major releases on Scala 2.12. The company now supports all 5 major Apache Spark and PySpark releases of 2.3.x, 2.4.x, 3.0.x, 3.1.x, and 3.2.x at once helping the community to migrate from earlier Apache Spark versions to newer releases without being worried about Spark NLP end of life support. This release also extends support for new Databricks and EMR instances on Spark 3.2.x clusters.
This release also comes with a brand new GPT2Transformer using OpenAI GPT-2 models for prediction at scale, new ALBERT, XLNet, RoBERTa, XLM-RoBERTa, and Longformer annotators to use existing or fine-tuned models for Sequence Classification, new distributed and trainable Word2Vec annotators, new state-of-the-art transformer models in many languages, a new param to useBestModel in NerDL during training, bug fixes, and lots more!
As always, John Snow Labs would like to thank its community for their feedback, questions, and feature requests.
To learn more about the release, including major features and improvements, bug fixes and installation details, read the release post here.
Source: John Snow Labs