2012 activities

Posted by ilan | | Wednesday 9 April 2014 9:35 pm

 

Activity

Start

End

Planning and Preparation Jan_2012 Mar_2012
Theory of Information Apr_2012 Jun_2012
Service Research May_2012 Aug_2012
Social Networks indicators May_2012 Jun_2012

Activity

Start

Up to

Understanding the problem and data knowledge Apr_2012 Jul_2012
Assessing tools for dealing and importing from XML May_2012 Jul_2012
Grouping Data Aug_2012 Aug_2012
Merging – Who talked to who Aug_2012 Sep_2012
Alingnment – Checking Data types and correction Aug_2012 Sep_2012
Understanding XML retrieaval Sep_2012 Oct_2012
Created Gen1.str – Uses XML Node tor retrieve data; Treats Generalisation Concepts for 1 group; Gets all concepts together function count_non_nulls and concatenation >< ; some trial with Transpose node Oct_2012 Oct_2012
Created Gen_october.str (8 versions) – Uses XML Node tor retrieve data; Treats Generalisation Concepts for all groups; Extends Gen1.str including all groups; Uses: Append Node; Sort Node; Agreggation Node; Derive Node using substring(locchar()) functions let extract day, month, year, hour, minutes, day of week; Generates some graphic as well Oct_2012 Oct_2012
OUTPUT FROM STREAM Gen_october_v8.str are: Date Processing – stream separates (day,month, year) and creates a one field date and other for the week; A Table with Qty of sutdents post for each group indiferent orders (TABLE 1 and TABLE 2);Total of students posts per group (TABLE 3); The words and terms that students had used that had been tagged for each Concept (TABLE 4) with a GRAPHIC with posts per group; The total of posts for each week (TABLE 5) and the GRAPHIC for posts per week. Oct_2012 Oct_2012
Used EXCEL to Retrieve from XML file fields with terms for each concept (problem: concepts fields have been multiplied and should be agregated); Transpose Node Trial and EXCEL Export Node. OUTPUT (EXCEL Transpose1.xml and Transpose2.xml files; The EXCEL file count_tags_group4.xls retrieved XML file as a TXT and manually kept only lines that contained the tags, so they could be counted!) Nov_2012 Nov_2012
02_Nov_2012 a deep check have been done to data retrieved by XML Node. An EXCEL file had been generated. STREAM1 retrieves data from the GROUP 4 XML file using XML node, for all concepts. It had been considered all concepts, but for the analysis the concept that was more frequent – S_data_values_numeric – had been choosen. Tha TABLE generated by STREAM1 shows for this concept 93 posts as the EXCEL file count_tags_group4.xls shows 102 tags for that concept.The conclusion was that some Tags were not retrieved. The final EXCEL file created in 09_Nov_2012 with Transposed data was named transposed4_ONLY_NUMERIC_GRP4.xls. It has the number of not null tags for The tag <S_data_values_numeric>. Using this file, all messagens that should have some data and were not retrieved by XML Node have been marked. Nov_2012 Nov_2012

Activity

Start

End

The STREAM group2_all.str shows how to remove the sufix automatically generated from the XML Source NODE so that the frequence of concepts for the same tag, i.e. S_DATA_DB, S_DATA_DB_1, S_DATA_DB_2 could be converted all to S_DATA_DB. The stream worked with Group 2 XML file Nov_2012 Nov_2012
09_Nov_2012 group4_all_nov.str has 2 parts. For the first part the STREAM reads the group 4 XML file and it creates an EXCEL file – Transposed4.xls – with the names of each concept and the occurrences, for each message.It creates a string field S_nostr from the field no. The concepts are changed to 1 sothat it will be aggregated. But, first, the names of the fields must be the same. To get the names of the fields, the table must be transposed. The second part of the STREAM reads that EXCEL file – Transposed4.xls – and it generates an output with the number of occurrences for each concept. Nov_2012 Nov_2012
03_nov_2012, the STREAM total_concepts_for_groups.str uses File Source (instead of the XML Node). The stream retrieves tags that start with <S_ and end with >. The sequence after the string > is not considerated. With the files containing only tags, the tags can be counted for each group. The process generatesThe graphs show the number of occurrences for tags that occurred more than 15 times, excepting the group 10 – that have less then 15 tags. Group 4 is the first, since it is the one used within the article. Nov_2012 Nov_2012
From 13 to 27_Nov the Stream Get_Date have been modified. It is an evolution of the first STREAM Gen_october_v8.str and now it included accumulation of frequency and some graphics. The final version was recorded 29_Nov_2012 and it is named getting_dateV12.str. This is the FIRST MAIN FILE and has a special webpage for documentation. Nov_2012 Nov_2012
From 27 to 28_Nov_2012 the former file Gen_october.str, created in October had some more nodes included and generated the final version that has all XML reading. This is the SECOND MAIN FILE and has a special webpage for documentation. Nov_2012 Nov_2012
Writing – With both MAIN FILES capabilities to generate graphs and tables to support the analysis, the article had been modified in the end of November, 2012. Nov_2012 Nov_2012
Documentation – Starting documentation of all process. Dec_2012 Dec_2012

 

Share

No Comments

No comments yet.

RSS feed for comments on this post. TrackBack URI

Sorry, the comment form is closed at this time.