Design for d6tflow framework

We can split our tasks to the following Task of `d6tflow`  framework
Task1 -> open Java file with correct encoding
Task2 -> remove all spaces and comments in it and save to another file
Task3 -> open file, find all method which can be inlined. Save target, extracted, full_ast, text_file, filename, row_csv from Task2
    Task4 -> Task3 get target, extracted and filter it. Save target, extracted, full_ast, text_file, filename, row_csv from Task3
    Task5 -> get result from Task3 and filter limited cases. Save target, extracted, full_ast, text_file, filename, row_csv from Task4
    Task6 -> Inline Method, save file, row_csv
Task 7 -> save row_csv to global DataFrame

Possible problems:
1) We have to save our `preprocessed` files to external memory, since we will have lots of files and it won't have enough memory to keep them in cache. Also, we have to keep them also in external memory since, it's our dataset which will be validated.
Seems, it cannot be done due to https://github.com/d6t/d6tflow/issues/6
2) We need to save different types of objects: ast tree, text. Seems, it's difficult:
https://github.com/d6t/d6tflow/issues/26


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Design for d6tflow framework #123

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Design for d6tflow framework #123

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions