-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Define standard necessary column names for input data. #24
Comments
Hi Anuj, I believe these columns are not part of the input file: However, we can use them as standardized column names for the output file. |
Hi @anujsinha3, these are the following column names:
For output columns, please take a look at the output column names below: |
@gracejia513 Please confirm once. |
@anujsinha3 did you have a standardized column for datetime? Is it UNIX_START_T? |
Column names have been standardized in the following format, i.e. The column names are insensitive to capital or small letters but do require '_' where mentioned. A few Examples: 'orig_lat', 'orig_long', 'unix_start_t', 'user_id' |
Currently, each column in the CSV file is accessed by an integer index. This has the following limitations:
We plan to use pandas data frames going forward, for which we need to standardize the column names that will be part of the input CSV file.
Existing column names: (Confirm if these column names are standard ones, or if any change if required)
"unix_start_t",
"user_ID",
"orig_lat",
"orig_long",
"orig_unc",
"stay_lat",
"stay_long",
"stay_unc",
"stay_dur",
"stay_ind",
"human_start_t"
The text was updated successfully, but these errors were encountered: