data-engineering
Here are 530 public repositories matching this topic...
Description
From community slack: https://prefect-community.slack.com/archives/CL09KU1K7/p1599551263372600
From that slack comment:
_I am trying to run a DbtShellTask with a dynamic set of dbt_kwargs that are defined at run time. The task works when I define the dbt_kwargs in the task initialization, but not when they are provided at runtime.
Looks like the dbt_kwargs needs an em
-
Updated
Sep 7, 2020
Describe the bug
This is basically one of the issues I called out in #1855:
When I run datasource new and exit the process at any point (e.g. ctrl+c), I still get a block for the credentials in config_variables.yml. However great_expectations.yml doesn't have the datasource entry. I would expect any kind of failure in the datasource creation process to not leave any artifacts.
**To Re
-
Updated
Sep 9, 2020
-
Updated
Sep 7, 2020 - JavaScript
-
Updated
Sep 9, 2020 - Python
-
Updated
Sep 10, 2020 - Jupyter Notebook
-
Updated
Jun 30, 2020 - Jupyter Notebook
-
Updated
Mar 9, 2020 - Python
Brief Description
Following up on #703, this issue seeks to introduce the ability to test whether in each column in a data frame the change between every two rows is within a user-defined range of values.
I would like to propose..
Example API
import pandas as pd
import numpy as np
# Create a random data frame
df = pd.DataFrame(data=[["2015-01-01 00:00:00", -0.76, 2, 2, -
Updated
Jul 9, 2020 - Python
-
Updated
Aug 21, 2020 - CSS
-
Updated
Sep 10, 2020 - R
-
Updated
Mar 5, 2020 - Python
-
Updated
Nov 29, 2018 - Java
-
Updated
Apr 20, 2020 - Python
In SubjectAreaRESTServicesInstance, it hard codes the default page size as 0, this is not correct
public static final String PAGE_SIZE_DEFAULT_VALUE = "0";
it should be changed to
public static final String PAGE_SIZE_DEFAULT_VALUE = "1000";
So it is consistent with OMAGServerConfig default
private static final int defaultMaxPageSize = 1000;
-
Updated
Sep 9, 2020 - TypeScript
-
Updated
Sep 9, 2020
-
Updated
May 11, 2020
-
Updated
Jun 18, 2020 - Python
-
Updated
Jul 28, 2020 - Python
-
Updated
Aug 11, 2020 - Python
-
Updated
Mar 25, 2019
-
Updated
Aug 7, 2019 - Jupyter Notebook
-
Updated
Aug 24, 2020 - Scala
-
Updated
Jun 22, 2020 - Python
Is there any way to write the Historical and Online feature stores directly in the hdfs?
Summary
I would like to write the Historical and Online feature stores in two different hdfs paths, using the parquet format.
I've tried use the S3Config class to do it, but not works..
Improve this page
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."
This issue tracks Superset's migration from JavaScript to TypeScript (as started in SIP-36). If you'd like to help with the migration, feel free to take an unchecked directory and convert the files within the immediate directory from JavaScript/JSX to TypeScript/TSX. #9162 and #9180 provide some tips for performing the migration. Once com