Skip to content

Commit bff5fc4

Browse files
authored
Merge pull request #152 from datacarpentry/annajiat-patch-1
Update 11-joins.md
2 parents 3d4a5c1 + d4b2b43 commit bff5fc4

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

_episodes/11-joins.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ print(df_SN7577i_b)
6565
~~~
6666
{: output}
6767

68-
The `concat()` function appends the rows from the two Dataframes to create the df_all_rows Dataframe. When you list this out you can see that all of the data rows are there, however there is a problem with the `index`.
68+
The `concat()` function appends the rows from the two Dataframes to create the df_all_rows Dataframe. When you list this out you can see that all of the data rows are there, however, there is a problem with the `index`.
6969

7070
~~~
7171
df_all_rows = pd.concat([df_SN7577i_a, df_SN7577i_b])
@@ -106,7 +106,7 @@ df_all_cols
106106
~~~
107107
{: .language-python}
108108

109-
We use the `axis=1` parameter to indicate that it is the columns that need to be joined together. Notice that the `Id` column appears twice, because it was a column in each dataset. This is not particularly desirable, but also not necessarily a problem. However there are better ways of combining columns from two Dataframes which avoid this problem.
109+
We use the `axis=1` parameter to indicate that it is the columns that need to be joined together. Notice that the `Id` column appears twice, because it was a column in each dataset. This is not particularly desirable, but also not necessarily a problem. However, there are better ways of combining columns from two Dataframes which avoid this problem.
110110

111111
### Scenario 3 - Using merge to join columns
112112

@@ -122,7 +122,7 @@ df_cd
122122
~~~
123123
{: .language-python}
124124

125-
In fact if there is only one column with the same name in each Dataframe, it will be assumed to be the one you want to join on. In this example the `Id` column
125+
In fact, if there is only one column with the same name in each Dataframe, it will be assumed to be the one you want to join on. In this example the `Id` column
126126

127127
Leaving the join column to default in this way is not best practice. It is better to explicitly name the column using the `on` parameter.
128128

0 commit comments

Comments
 (0)