Coevolving Innovations

… in Business Organizations and Information Technologies

Currently Viewing Posts Tagged quantitative methods

Learning data science, hands-on

For the Quantitative Methodologies for Design Research (定量研究方法) course for Ph.D. students at Tongji University in spring 2017, Susu Nousala invited me to join the team of instructors in collaborative education in Shanghai.  Experts were brought in during the course to guide the graduate students.

My participation in the course over two days had three parts:  (a) preparing a lecture outline; (b) orienting the students; and (c) equipping the students with tools.

(A) Preparing a lecture outline

While I’m comfortable with the mathematics underlying statistical analysis, I have a lot of practical experience of working with business executives who aren’t.  Thus, my approach to working with data relies a lot on presentation graphics to defog the phenomena.  While the label of data science began to rise circa 2012, I’ve had the benefit of practical experience that predates that.

Today's APL
AGSS: A Graphical Statistical System (1994)

In my first professional assignment in IBM Canada in 1985, data science would have been called econometrics.  My work included forecasting country sales, based on price-performance indexes (from the mainframe, midrange and personal computer product divisions) and economic outlooks from Statistics Canada.  Two years before the Macintosh II would bring color to personal computing, I was an early adopter of GRAFSTAT: “An APL system for interactive scientific-engineering graphics and data analysis” developed at IBM Research.  This would eventually become an IBM program product by called AGSS (A Graphical Statistical System) by 1994.

Metaphor Computer Systems workstation
Metaphor Computer Systems workstation

In 1988, I had an assignment where data science would have been called marketing science.  I was sent to California to work in the IBM partnership with Metaphor Computer Systems. This was a Xerox PARC spin-off with a vision that predated the first web page on the World Wide Web by a few years.  These activities led me into the TIMS Marketing Science Conference in 1990, cofounding the Canadian Centre for Marketing Information Technologies (C2MIT) and contributing chapters to The Marketing Information Revolution published in 1994.

This journey led me to appreciate the selection and use of computer-based tools for quantitative analysis.  Today, the two leading platforms in “Data Science 101” are Python (a general purpose language with statistical libraries), and the R Project for Statistical Computing (a specialized package for data analysis and visualization).  Both are open source projects, and free to download and use on personal computers.  I tried both.  R is a higher level programming language more similar to the APL programming language that gets work done more quickly.  For statistical work, I recommend R over Python (although APL is a theoretically better implementation).

Intro to R Programming, Big Data University
Intro to R Programming, Big Data University, Feb. 22, 2017

Since I live in Toronto, I attended the February session of Data Science with R – Bootcamp in person, at Ryerson University.  There, I was watched Polong Lin leading a class through R using the Jupyter notebook, both in (i) an interactive version, and (ii) a printable version.  Students had the choice to either follow Polong (i) actively, in a step-by-step execution in the Cognitive Class Virtual Lab (formerly called the Data Scientist Workbench) with a cloud-based R session through their web browsers, or (ii) passively, reading the static printable content.

For the Quantitative Methodologies for Design Research (定量研究方法) course for Ph.D. students at Tongji University in spring 2017, Susu Nousala invited me to join the team of instructors in collaborative education in Shanghai.  Experts were brought in during the course to guide the graduate students.

My participation in the course over two days had three parts:  (a) preparing a lecture outline; (b) orienting the students; and (c) equipping the students with tools.

(A) Preparing a lecture outline

While I’m comfortable with the mathematics underlying statistical analysis, I have a lot of practical experience of working with business executives who aren’t.  Thus, my approach to working with data relies a lot on presentation graphics to defog the phenomena.  While the label of data science began to rise circa 2012, I’ve had the benefit of practical experience that predates that.

Today's APL
AGSS: A Graphical Statistical System (1994)

In my first professional assignment in IBM Canada in 1985, data science would have been called econometrics.  My work included forecasting country sales, based on price-performance indexes (from the mainframe, midrange and personal computer product divisions) and economic outlooks from Statistics Canada.  Two years before the Macintosh II would bring color to personal computing, I was an early adopter of GRAFSTAT: “An APL system for interactive scientific-engineering graphics and data analysis” developed at IBM Research.  This would eventually become an IBM program product by called AGSS (A Graphical Statistical System) by 1994.

Metaphor Computer Systems workstation
Metaphor Computer Systems workstation

In 1988, I had an assignment where data science would have been called marketing science.  I was sent to California to work in the IBM partnership with Metaphor Computer Systems. This was a Xerox PARC spin-off with a vision that predated the first web page on the World Wide Web by a few years.  These activities led me into the TIMS Marketing Science Conference in 1990, cofounding the Canadian Centre for Marketing Information Technologies (C2MIT) and contributing chapters to The Marketing Information Revolution published in 1994.

This journey led me to appreciate the selection and use of computer-based tools for quantitative analysis.  Today, the two leading platforms in “Data Science 101” are Python (a general purpose language with statistical libraries), and the R Project for Statistical Computing (a specialized package for data analysis and visualization).  Both are open source projects, and free to download and use on personal computers.  I tried both.  R is a higher level programming language more similar to the APL programming language that gets work done more quickly.  For statistical work, I recommend R over Python (although APL is a theoretically better implementation).

Intro to R Programming, Big Data University
Intro to R Programming, Big Data University, Feb. 22, 2017

Since I live in Toronto, I attended the February session of Data Science with R – Bootcamp in person, at Ryerson University.  There, I was watched Polong Lin leading a class through R using the Jupyter notebook, both in (i) an interactive version, and (ii) a printable version.  Students had the choice to either follow Polong (i) actively, in a step-by-step execution in the Cognitive Class Virtual Lab (formerly called the Data Scientist Workbench) with a cloud-based R session through their web browsers, or (ii) passively, reading the static printable content.

  • RSS qoto.org/@daviding (Mastodon)

    • daviding: Will this decade be May 27, 2020
      Will this decade be called the "Dark Twenties", in post-pandemic economic sociology? #JohnIbbitson writes: > It took years for Western economies to fully recover from the economic shock of 2008-09. This shock is far worse. How much worse? No one can be sure. [....] > We are entering the Dark Twenties. No one knows when […]
    • daviding: Moderating social me May 27, 2020
      Moderating social media context in an nuanced way may be done with a warning or caution, rather than by deleting the message or banning the individual. #HenryFarrell at #WashingtonPost analyzes fact-checking on POTUS. > Now, Twitter has done just this. Trump’s tweet has not been removed — but it has been placed behind a notice, […]
    • daviding: Our immune systems a May 26, 2020
      Our immune systems are complex, so improving resistance to disease may be puffery, writes #TimothyCaulfield . > I looked at how the phrase “boosting our immune system” is being represented on social media. This concept is everywhere right now: it is being pushed by .... But in reality, the immune system is fantastically complex and can’t be “boosted.” (Even […]
    • daviding: Ventures founded on May 17, 2020
      Ventures founded on growth maximization thinking unicorn might instead turn towards sustainability as camels. > Where Silicon Valley has been chasing unicorns (a colloquial term for startups with billion-dollar valuations), “camel” startups, such as those founded by leading global entrepreneurs, prioritize sustainability and resiliency.> The humble camel adapts to multiple climates, survives without food or […]
    • daviding: Death of the office, May 17, 2020
      Death of the office, in pandemic times, with a larger perspective back in history. > Offices have always been profoundly flawed spaces. Those of the East India Company, among the world’s first, were built more for bombast than bureaucracy. They were sermons in stone, and the solidity of every marble step, the elegance of every […]
  • RSS on IngBrief

    • Wholism, reductionism (Francois, 2004)
      Proponents of #SystemsThinking often espouse holism to counter over-emphasis on reductionism. Reading some definitions from an encyclopedia positions one in the context of the other (François 2004).
    • It matters (word use)
      Saying “it doesn’t matter” or “it matters” is a common expression in everyday English. For scholarly work, I want to “keep using that word“, while ensuring it means what I want it to mean. The Oxford English Dictionary (third edition, March 2001) has three entries for “matter”. The first two entries for a noun. The […]
    • Systemic Change, Systematic Change, Systems Change (Reynolds, 2011)
      It's been challenging to find sources that specifically define two-word phrases -- i.e. "systemic change", "systematic change", "systems change" -- as opposed to loosely inferring reductively from one-word definitions in recombination. MartinReynolds @OpenUniversity clarifies uses of the phrases, with a critical eye into motives for choosing a specific label, as well as associated risks and […]
    • Environmental c.f. ecological (Francois, 2004; Allen, Giampietro Little 2003)
      The term "environmental" can be mixed up with "ecological", when the meanings are different. We can look at the encyclopedia definitions (François 2004), and then compare the two in terms of applied science (i.e. engineering with (#TimothyFHAllen @MarioGiampietro and #AmandaMLittle, 2003).
    • Christopher Alexander’s A Pattern Language: Analysing, Mapping and Classifying the Critical Response | Dawes and Ostwald | 2017
      While many outside of the field of architecture like the #ChristopherAlexander #PatternLanguage approach, it's not so well accepted by his peers. A summary of criticisms by #MichaelJDawes and #MichaelJOstwald @UNSWBuiltEnv is helpful in appreciating when the use of pattern language might be appropriate or not appropriate.
    • Field (system definitions, 2004, plus social)
      Systems thinking should include not only thinking about the system, but also its environment. Using the term "field" as the system of interest plus its influences leaves a lot of the world uncovered. From the multiple definitions in the International Encyclopedia of Systems and Cybernetics , there is variety of ways of understanding "field".
  • Recent Posts

  • Archives

  • RSS on daviding.com

    • 2020/05 Moments May 2020
      Life at home is much the same with the pandemic sheltering-in-place directives, touring city streets on bicycle, avoiding the parks on weekends.
    • 2020/04 Moments April 2020
      Living in social isolation in our house with 5 family members, finishing off teaching courses and taking courses.
    • 2020/03 Moments March 2020
      The month started with a hectic coincidence of events as both a teacher and student at two universities, abruptly shifting to low gear with government directives for social distancing.
    • 2020/02 Moments February 2020
      Winter has discouraged enjoying the outside, so more occasions for friend and family inside.
    • 2020/01 Moments January 2020
      Back to school, teaching and learning at 2 universities.
    • 2019/12 Moments December 2019
      First half of December in finishing up course assignments and preparing for exams; second half on 11-day family vacation in Mexico City.
  • RSS on Media Queue

  • Meta

  • Creative Commons License
    This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
    Theme modified from DevDmBootstrap4 by Danny Machal