2012 activities
Activity |
Start |
End |
Planning and Preparation | Jan_2012 | Mar_2012 |
Theory of Information | Apr_2012 | Jun_2012 |
Service Research | May_2012 | Aug_2012 |
Social Networks indicators | May_2012 | Jun_2012 |
Activity |
Start |
Up to |
Understanding the problem and data knowledge | Apr_2012 | Jul_2012 |
Assessing tools for dealing and importing from XML | May_2012 | Jul_2012 |
Grouping Data | Aug_2012 | Aug_2012 |
Merging – Who talked to who | Aug_2012 | Sep_2012 |
Alingnment – Checking Data types and correction | Aug_2012 | Sep_2012 |
Understanding XML retrieaval | Sep_2012 | Oct_2012 |
Created Gen1.str – Uses XML Node tor retrieve data; Treats Generalisation Concepts for 1 group; Gets all concepts together function count_non_nulls and concatenation >< ; some trial with Transpose node | Oct_2012 | Oct_2012 |
Created Gen_october.str (8 versions) – Uses XML Node tor retrieve data; Treats Generalisation Concepts for all groups; Extends Gen1.str including all groups; Uses: Append Node; Sort Node; Agreggation Node; Derive Node using substring(locchar()) functions let extract day, month, year, hour, minutes, day of week; Generates some graphic as well | Oct_2012 | Oct_2012 |
OUTPUT FROM STREAM Gen_october_v8.str are: Date Processing – stream separates (day,month, year) and creates a one field date and other for the week; A Table with Qty of sutdents post for each group indiferent orders (TABLE 1 and TABLE 2);Total of students posts per group (TABLE 3); The words and terms that students had used that had been tagged for each Concept (TABLE 4) with a GRAPHIC with posts per group; The total of posts for each week (TABLE 5) and the GRAPHIC for posts per week. | Oct_2012 | Oct_2012 |
Used EXCEL to Retrieve from XML file fields with terms for each concept (problem: concepts fields have been multiplied and should be agregated); Transpose Node Trial and EXCEL Export Node. OUTPUT (EXCEL Transpose1.xml and Transpose2.xml files; The EXCEL file count_tags_group4.xls retrieved XML file as a TXT and manually kept only lines that contained the tags, so they could be counted!) | Nov_2012 | Nov_2012 |
02_Nov_2012 a deep check have been done to data retrieved by XML Node. An EXCEL file had been generated. STREAM1 retrieves data from the GROUP 4 XML file using XML node, for all concepts. It had been considered all concepts, but for the analysis the concept that was more frequent – S_data_values_numeric – had been choosen. Tha TABLE generated by STREAM1 shows for this concept 93 posts as the EXCEL file count_tags_group4.xls shows 102 tags for that concept.The conclusion was that some Tags were not retrieved. The final EXCEL file created in 09_Nov_2012 with Transposed data was named transposed4_ONLY_NUMERIC_GRP4.xls. It has the number of not null tags for The tag <S_data_values_numeric>. Using this file, all messagens that should have some data and were not retrieved by XML Node have been marked. | Nov_2012 | Nov_2012 |
Activity |
Start |
End |
The STREAM group2_all.str shows how to remove the sufix automatically generated from the XML Source NODE so that the frequence of concepts for the same tag, i.e. S_DATA_DB, S_DATA_DB_1, S_DATA_DB_2 could be converted all to S_DATA_DB. The stream worked with Group 2 XML file | Nov_2012 | Nov_2012 |
09_Nov_2012 group4_all_nov.str has 2 parts. For the first part the STREAM reads the group 4 XML file and it creates an EXCEL file – Transposed4.xls – with the names of each concept and the occurrences, for each message.It creates a string field S_nostr from the field no. The concepts are changed to 1 sothat it will be aggregated. But, first, the names of the fields must be the same. To get the names of the fields, the table must be transposed. The second part of the STREAM reads that EXCEL file – Transposed4.xls – and it generates an output with the number of occurrences for each concept. | Nov_2012 | Nov_2012 |
03_nov_2012, the STREAM total_concepts_for_groups.str uses File Source (instead of the XML Node). The stream retrieves tags that start with <S_ and end with >. The sequence after the string > is not considerated. With the files containing only tags, the tags can be counted for each group. The process generatesThe graphs show the number of occurrences for tags that occurred more than 15 times, excepting the group 10 – that have less then 15 tags. Group 4 is the first, since it is the one used within the article. | Nov_2012 | Nov_2012 |
From 13 to 27_Nov the Stream Get_Date have been modified. It is an evolution of the first STREAM Gen_october_v8.str and now it included accumulation of frequency and some graphics. The final version was recorded 29_Nov_2012 and it is named getting_dateV12.str. This is the FIRST MAIN FILE and has a special webpage for documentation. | Nov_2012 | Nov_2012 |
From 27 to 28_Nov_2012 the former file Gen_october.str, created in October had some more nodes included and generated the final version that has all XML reading. This is the SECOND MAIN FILE and has a special webpage for documentation. | Nov_2012 | Nov_2012 |
Writing – With both MAIN FILES capabilities to generate graphs and tables to support the analysis, the article had been modified in the end of November, 2012. | Nov_2012 | Nov_2012 |
Documentation – Starting documentation of all process. | Dec_2012 | Dec_2012 |