py4j.protocol.Py4JJavaError with PySpark










2















I am facing the error while running the following PySpark Program.
Using



OS Windows 10



Java version 8



Spark version 2.4.0



Python version 3.6



CODE:



from pyspark.context import SparkContext
sc = SparkContext.getOrCreate()
textFile= sc.textFile(r"file.txt")
textFile.count()


ERROR:



 ---------------------------------------------------------------------------
Py4JJavaError Traceback (most recent call last)
<ipython-input-7-99998e5c7b17> in <module>()
----> 1 textFile.count()
Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 1 times, most recent failure: Lost task 0.0 in stage 1.0 (TID 4, localhost, executor driver): org.apache.spark.SparkException: Python worker failed to connect back.
at org.apache.spark.api.python.PythonWorkerFactory.createSimpleWorker(PythonWorkerFactory.scala:170)...


Many have same problem but they solved by changing java version to 8 but i am using java version 8 even thought getting error



Any help appreciated.



Thanks.










share|improve this question






















  • Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

    – Indrajit
    Feb 8 at 15:00











  • Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

    – Indrajit
    Feb 8 at 15:17















2















I am facing the error while running the following PySpark Program.
Using



OS Windows 10



Java version 8



Spark version 2.4.0



Python version 3.6



CODE:



from pyspark.context import SparkContext
sc = SparkContext.getOrCreate()
textFile= sc.textFile(r"file.txt")
textFile.count()


ERROR:



 ---------------------------------------------------------------------------
Py4JJavaError Traceback (most recent call last)
<ipython-input-7-99998e5c7b17> in <module>()
----> 1 textFile.count()
Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 1 times, most recent failure: Lost task 0.0 in stage 1.0 (TID 4, localhost, executor driver): org.apache.spark.SparkException: Python worker failed to connect back.
at org.apache.spark.api.python.PythonWorkerFactory.createSimpleWorker(PythonWorkerFactory.scala:170)...


Many have same problem but they solved by changing java version to 8 but i am using java version 8 even thought getting error



Any help appreciated.



Thanks.










share|improve this question






















  • Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

    – Indrajit
    Feb 8 at 15:00











  • Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

    – Indrajit
    Feb 8 at 15:17













2












2








2








I am facing the error while running the following PySpark Program.
Using



OS Windows 10



Java version 8



Spark version 2.4.0



Python version 3.6



CODE:



from pyspark.context import SparkContext
sc = SparkContext.getOrCreate()
textFile= sc.textFile(r"file.txt")
textFile.count()


ERROR:



 ---------------------------------------------------------------------------
Py4JJavaError Traceback (most recent call last)
<ipython-input-7-99998e5c7b17> in <module>()
----> 1 textFile.count()
Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 1 times, most recent failure: Lost task 0.0 in stage 1.0 (TID 4, localhost, executor driver): org.apache.spark.SparkException: Python worker failed to connect back.
at org.apache.spark.api.python.PythonWorkerFactory.createSimpleWorker(PythonWorkerFactory.scala:170)...


Many have same problem but they solved by changing java version to 8 but i am using java version 8 even thought getting error



Any help appreciated.



Thanks.










share|improve this question














I am facing the error while running the following PySpark Program.
Using



OS Windows 10



Java version 8



Spark version 2.4.0



Python version 3.6



CODE:



from pyspark.context import SparkContext
sc = SparkContext.getOrCreate()
textFile= sc.textFile(r"file.txt")
textFile.count()


ERROR:



 ---------------------------------------------------------------------------
Py4JJavaError Traceback (most recent call last)
<ipython-input-7-99998e5c7b17> in <module>()
----> 1 textFile.count()
Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.collectAndServe.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 1 times, most recent failure: Lost task 0.0 in stage 1.0 (TID 4, localhost, executor driver): org.apache.spark.SparkException: Python worker failed to connect back.
at org.apache.spark.api.python.PythonWorkerFactory.createSimpleWorker(PythonWorkerFactory.scala:170)...


Many have same problem but they solved by changing java version to 8 but i am using java version 8 even thought getting error



Any help appreciated.



Thanks.







python-3.x pyspark






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 16 '18 at 6:40









NusrathNusrath

16914




16914












  • Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

    – Indrajit
    Feb 8 at 15:00











  • Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

    – Indrajit
    Feb 8 at 15:17

















  • Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

    – Indrajit
    Feb 8 at 15:00











  • Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

    – Indrajit
    Feb 8 at 15:17
















Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

– Indrajit
Feb 8 at 15:00





Any updates on this? Even I am facing this error with Java 8 release 202. I am using Python 3.7 running on Windows 10.

– Indrajit
Feb 8 at 15:00













Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

– Indrajit
Feb 8 at 15:17





Was able to solve this by going back to Spark 2.3 as per this post: stackoverflow.com/questions/53252181/…

– Indrajit
Feb 8 at 15:17












0






active

oldest

votes












Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53332686%2fpy4j-protocol-py4jjavaerror-with-pyspark%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53332686%2fpy4j-protocol-py4jjavaerror-with-pyspark%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Top Tejano songwriter Luis Silva dead of heart attack at 64

ReactJS Fetched API data displays live - need Data displayed static

Evgeni Malkin