Package

com.swoop.spark

records

Permalink

package records

Spark Records package.

See also

Spark Records documentation

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. records
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. trait AccumulatorMetrics extends Metrics

    Permalink

    A Metrics implementation using com.swoop.spark.accumulators.ByKeyAdditiveAccumulator.

  2. trait BuildContext[Ctx] extends HasJobContext[Ctx] with HasFeatures with HasDetails with ThrowsErrors

    Permalink

    Convenience trait that combines the main services exposed by RecordBuilder.

    Convenience trait that combines the main services exposed by RecordBuilder.

    Ctx

    the job context

    See also

    Build Context

  3. trait BuildsRecords[Rec <: Record[_, _]] extends Serializable

    Permalink

    Topmost trait for record builders.

  4. case class Cause(message: String, stack: Seq[StackElement] = Seq.empty) extends Product with Serializable

    Permalink

    A simple, serializable exception cause that keeps the exception message and the stack trace.

  5. abstract class CompositeRecordBuilder[BuildInput, RecordInput, Rec <: Record[_, _], Ctx <: JobContext] extends RecordBuilderLike[Rec, Ctx]

    Permalink

    Base class for record builders that produce multiple records for one input.

    Base class for record builders that produce multiple records for one input.

    BuildInput

    The input type for the builder.

    RecordInput

    The input type records are built from.

    Rec

    The record type.

    Ctx

    The job context type

    See also

    NestedRecordBuilder

    Composite Builders

  6. trait DriverContext extends Serializable with Metrics with Flight

    Permalink

    Shared behaviors for the build context that integrates with Spark context.

    Shared behaviors for the build context that integrates with Spark context.

    See also

    Build Context

  7. trait FeatureMask extends AnyRef

    Permalink

    Trait for types that have a feature mask.

  8. trait Features extends AnyRef

    Permalink

    Definitions for the meaning of bits in the features bit mask of com.swoop.spark.records.Record.

    Definitions for the meaning of bits in the features bit mask of com.swoop.spark.records.Record.

    The least significant 16 bits are reserved for framework use. The next 15 bits are available for application use. The most significant bit is reserved for framework use also.

  9. class FlatRecordEnvironment extends RecordEnvironment

    Permalink

    A flat record environment.

  10. trait Flight extends AnyRef

    Permalink

    Trait identifying flights: one or more related Spark jobs.

  11. trait HasDetails extends AnyRef

    Permalink

    Trait for a builder that aggregates details, which will become issues.

    Trait for a builder that aggregates details, which will become issues.

    Under the covers, details are Throwable instances: either exceptions thrown by the application code or ThrowableMessage instances that may be thrown and caught or simply constructed using one of the convenience methods exposed by this trait.

  12. trait HasFeatures extends AnyRef

    Permalink

    Trait for a builder that aggregates features.

  13. trait HasJobContext[Ctx] extends AnyRef

    Permalink

    Trait for a builder with a job context.

  14. case class Issue(category: Int, message: String, causes: Option[Seq[Cause]] = None, id: Option[Int] = None, details: Option[String] = None) extends Product with Serializable

    Permalink

    An issue is a container for diagnostic information including both simple fields and an optional sequence of causes.

    An issue is a container for diagnostic information including both simple fields and an optional sequence of causes.

    category

    The category of the issue. It matches the with the values coming from the different types of Issue$.ThrowableMessageType featureMask values.

    message

    The message related to the issue.

    causes

    The causes of this issue.

    id

    The issue ID.

    details

    Optional additional details for the issue. For root cause analysis purposes it is useful to to split messages we want to associate with records into a stable portion (the issue message) and a variable portion (the issue details). This enables stable grouping and counting.

    See also

    Issue$.messageWithDetails

  15. class IssuesDataFrame extends RootCauseAnalysisOps

    Permalink

    Provides root cause analysis operations at the issues level.

  16. trait JobContext extends Serializable with Metrics with Flight

    Permalink

    Shared behaviors for the build context that provides services to builders but does not work directly with Spark context APIs, allowing fast Spark-less testing.

    Shared behaviors for the build context that provides services to builders but does not work directly with Spark context APIs, allowing fast Spark-less testing.

    See also

    Build Context

  17. trait Metrics extends AnyRef

    Permalink

    Shared trait defining metrics collection services

  18. abstract class NestedRecordBuilder[Input, Data <: Product, Rec <: Record[Data, _], Ctx <: JobContext] extends RecordBuilder[Input, Data, Rec, Ctx]

    Permalink

    Base class for nested record builders of single records.

    Base class for nested record builders of single records.

    Input

    The source input

    Data

    The record data

    Rec

    The record

    Ctx

    The job context

  19. class PartitionedRecordEnvironment extends RecordEnvironment

    Permalink

    A partitioned error environment.

  20. implicit class RCARichDataFrame extends AnyRef

    Permalink

    Implicit operations on Spark Records dataframes.

    Implicit operations on Spark Records dataframes.

    Most operations can be applied equally at the record level or at the level of unwound issues.

    See also

    RootCauseAnalysis

    RootCauseAnalysisOps

  21. trait Record[A <: Product, Source] extends Product with Serializable

    Permalink

    A Spark Record is an envelope around data.

    A Spark Record is an envelope around data.

    A

    The data stored in the record

    Source

    The input type

    See also

    Spark Records documentation

  22. abstract class RecordBuilder[Input, Data <: Product, Rec <: Record[Data, _], Ctx <: JobContext] extends RecordBuilderLike[Rec, Ctx] with BuildContext[Ctx]

    Permalink

    Base class for record builders of single records.

    Base class for record builders of single records.

    Input

    The source input

    Data

    The record data

    Rec

    The record

    Ctx

    The job context

  23. abstract class RecordBuilderLike[Rec <: Record[_, _], Ctx <: JobContext] extends BuildsRecords[Rec] with HasJobContext[Ctx] with ThrowsErrors

    Permalink

    Base class for single and composite record builders.

    Base class for single and composite record builders.

    Rec

    The record

    Ctx

    The job context

  24. trait RecordEnvironment extends Serializable

    Permalink

    Base trait for the implicits that provide hints about how records are persisted.

    Base trait for the implicits that provide hints about how records are persisted.

    Annotations
    @implicitNotFound( ... )
  25. class RecordsDataFrame extends RootCauseAnalysisOps

    Permalink

    Provides root cause analysis operations at the record level.

  26. implicit class RecordsRichDataset[A <: Product, Rec] extends Serializable

    Permalink

    Implicit operations on Spark Records datasets.

    Implicit operations on Spark Records datasets.

    See also

    RootCauseAnalysisOps

  27. implicit final class RecordsRichOption[A] extends AnyVal

    Permalink

    Extends Option with record building sugar.

  28. implicit final class RecordsRichTry[A] extends AnyVal

    Permalink

    Extends scala.util.Try with record building sugar.

  29. implicit final class RecordsRichTryOption[A] extends AnyVal

    Permalink

    Extends scala.util.Try with optional values with record building sugar.

  30. class RootCauseAnalysis extends RootCauseAnalysisOps

    Permalink

    Provides root cause analysis operations at any level by automatically dispatching to RecordsDataFrame or IssuesDataFrame based on the schema of the provided dataframe.

  31. trait RootCauseAnalysisOps extends Serializable

    Permalink

    Base trait for root cause analysis operations.

  32. class SimpleDriverContext extends DriverContext with AccumulatorMetrics

    Permalink

    Basic implementation of DriverContext that uses AccumulatorMetrics and includes a convenience factory for creating job contexts.

    Basic implementation of DriverContext that uses AccumulatorMetrics and includes a convenience factory for creating job contexts.

    See also

    Build Context

  33. class SimpleJobContext extends JobContext with AccumulatorMetrics

    Permalink

    Basic implementation of JobContext that uses AccumulatorMetrics.

    Basic implementation of JobContext that uses AccumulatorMetrics.

    Note

    Spark accumulators can be used locally without registration.

    See also

    Build Context

  34. case class StackElement(className: String, methodName: String, fileName: String, lineNumber: Int) extends Product with Serializable

    Permalink

    A simple, serializable adapter for StackTraceElement.

  35. case class ThrowableMessage(kind: ThrowableMessageType, msg: String, msgId: Int = 0, cause: Throwable = null) extends Throwable with FeatureMask with Product with Serializable

    Permalink

    Base class for throwable diagnostic messages with IDs for quick big data processing.

    Base class for throwable diagnostic messages with IDs for quick big data processing.

    kind

    the type of message (error, warning, info or debug)

    msg

    the message

    msgId

    the optional message ID

    cause

    the cause, if the message is being thrown as a result of an exception

  36. trait ThrowsErrors extends AnyRef

    Permalink

    Convenience trait for throwing ThrowableMessage.

Value Members

  1. object FeatureMask

    Permalink
  2. object Features extends Features

    Permalink
  3. object FlatRecordEnvironment extends Serializable

    Permalink
  4. object Issue extends ThrowsErrors with Serializable

    Permalink
  5. object PartitionedRecordEnvironment extends Serializable

    Permalink
  6. object Record extends Serializable

    Permalink
  7. object RootCauseAnalysis extends Serializable

    Permalink
  8. object SimpleDriverContext extends Serializable

    Permalink
  9. object SimpleJobContext extends Serializable

    Permalink
  10. implicit def optionalIssueSeq(issues: Seq[Issue]): Option[Seq[Issue]]

    Permalink

    Convenience conversion to make issue saving during record creation easier.

Inherited from AnyRef

Inherited from Any

Ungrouped