Thursday, August 2, 2018

Talend and Apache Spark: Debugging and Logging Best Practices

So far, our journey on using Apache Spark with Talend has been a fun and exciting one. The first three posts on my series provided an overview of how Talend works with Apache Spark, some similarities between Talend and Spark Submit, the configuration options available for Spark jobs in Talend and how to tune Spark jobs for performance. If you haven't already read them you should do so before getting started here. Start with: "Talend & Apache Spark: A Technical Primer"; "Talend vs. Spark Submit Configuration: What's the Difference?"; "Apache Spark and Talend: Performance and Tuning."

To finish this series, we're going to talking about logging and debugging. When starting your journey with using Talend and Apache Spark you may have run into the error like below printed out in your console log:



from DZone.com Feed https://ift.tt/2OAdwJ2

No comments:

Post a Comment