nable to execute HTTP request: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException

### Is there an existing issue for this?

- [x] I have searched the existing issues and did not find a match.

### Who can help?

_No response_

### What are you working on?

Dear sir,
I referenced your code to import pipeline, but encountered an error,

### Current Behavior

I run the following script in Azure Databricks workspace , An error occurred
"""
import sparknlp
import os
from pyspark.sql import SparkSession
from pyspark.conf import SparkConf

os.environ['JAVA_HOME'] = 'C:\\Program Files\\Eclipse Adoptium\\jdk-11.0.27.6-hotspot' 
os.environ['HADOOP_HOME'] = 'C:\\hadoop'
os.environ['SPARK_LOCAL_DIRS'] = 'C:\\spark_temp'

spark = sparknlp.start(gpu=False,
                       apple_silicon=False,
                       aarch64=False,
                       memory="16G",
                       cache_folder="",
                       log_folder="",
                       cluster_tmp_dir="",
                       params={"spark.jars.repositories": "http://s3.amazonaws.com/auxdata.johnsnowlabs.com"},
                       real_time_output=False,
                       output_level=1)
 
print("Spark NLP version: ", sparknlp.version())
print("Apache Spark version: ", spark.version)
 
from pyspark.ml import Pipeline
from sparknlp.annotator import UniversalSentenceEncoder
from sparknlp.common import *
from sparknlp.base import *
 
newstestdataset = spark.read \
    .option("header", True) \
    .option("inferSchema", True) \
    .csv("test_data.csv")

newstestdataset.show(10)

document = DocumentAssembler() \
    .setInputCol("description") \
    .setOutputCol("document")

use = UniversalSentenceEncoder.pretrained() \
    .setInputCols(["document"]) \
    .setOutputCol("sentence_embeddings")

pipeline = Pipeline(stages=[document, use])
testdataset = pipeline.fit(newstestdataset).transform(newstestdataset)

testdataset.show()

spark.stop()

“”“
py4j.protocol.Py4JJavaError: An error occurred while calling z:com.johnsnowlabs.nlp.pretrained.PythonResourceDownloader.getDownloadSize.
: com.amazonaws.SdkClientException: Unable to execute HTTP request: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target


### Expected Behavior

The scripts are expected to run successfully.

### Steps To Reproduce

The error script has been attached
"""
import sparknlp
import os
from pyspark.sql import SparkSession
from pyspark.conf import SparkConf

os.environ['JAVA_HOME'] = 'C:\\Program Files\\Eclipse Adoptium\\jdk-11.0.27.6-hotspot' 
os.environ['HADOOP_HOME'] = 'C:\\hadoop'
os.environ['SPARK_LOCAL_DIRS'] = 'C:\\spark_temp'

spark = sparknlp.start(gpu=False,
                       apple_silicon=False,
                       aarch64=False,
                       memory="16G",
                       cache_folder="",
                       log_folder="",
                       cluster_tmp_dir="",
                       params={"spark.jars.repositories": "http://s3.amazonaws.com/auxdata.johnsnowlabs.com"},
                       real_time_output=False,
                       output_level=1)
 
print("Spark NLP version: ", sparknlp.version())
print("Apache Spark version: ", spark.version)
 
from pyspark.ml import Pipeline
from sparknlp.annotator import UniversalSentenceEncoder
from sparknlp.common import *
from sparknlp.base import *
 
newstestdataset = spark.read \
    .option("header", True) \
    .option("inferSchema", True) \
    .csv("test_data.csv")

newstestdataset.show(10)

document = DocumentAssembler() \
    .setInputCol("description") \
    .setOutputCol("document")

use = UniversalSentenceEncoder.pretrained() \
    .setInputCols(["document"]) \
    .setOutputCol("sentence_embeddings")

pipeline = Pipeline(stages=[document, use])
testdataset = pipeline.fit(newstestdataset).transform(newstestdataset)

testdataset.show()

spark.stop()

“”“

### Spark NLP version and Apache Spark

Spark NLP version and Apache Spark
Spark NLP version: 5.5.3
spark.version; 3.2.1

### Type of Spark Application

_No response_

### Java Version

11.0.27" 2025-04-15

### Java Home Directory

_No response_

### Setup and installation

_No response_

### Operating System and Version

_No response_

### Link to your project (if available)

_No response_

### Additional Information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

nable to execute HTTP request: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException #14584

Is there an existing issue for this?

Who can help?

What are you working on?

Current Behavior

Expected Behavior

Steps To Reproduce

Spark NLP version and Apache Spark

Type of Spark Application

Java Version

Java Home Directory

Setup and installation

Operating System and Version

Link to your project (if available)

Additional Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

nable to execute HTTP request: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException #14584

Description

Is there an existing issue for this?

Who can help?

What are you working on?

Current Behavior

Expected Behavior

Steps To Reproduce

Spark NLP version and Apache Spark

Type of Spark Application

Java Version

Java Home Directory

Setup and installation

Operating System and Version

Link to your project (if available)

Additional Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions