SPSS Statistics

SPSS Statistics

Your hub for statistical analysis, data management, and data documentation. Connect, learn, and share with your peers! 

 View Only
  • 1.  Do I need Spss Statistics Server?

    Posted Thu November 23, 2023 10:31 AM

    Issue:

    Our use case is described below. In short, we are looking to automate execution of pre-written script and send the output via mail to the scripter.

    Background

    Our teams use IBM SPSS for most of the projects that we program in-house to perform data checks. Also known as Data Validation (DV), to ensure data is consistent with the logic mentioned in the questionnaire. The DV checks are scripted in SPSS when the survey is launched and could be required multiple times (as more respondents' data is collected) until the survey is closed in the field. These checks are run at least twice (if not more) – after 10%-20% collection (known as Soft-launch /SL checks) and at 100% (final checks).

     The checks are scripted after the survey launch phase and the data checks requires pulling the data from data collection platforms (such as Decipher, ConfirmIT, CMIX) and executing the script against the dataset. The output is, generally, collected in two files –

    1. first one is more like a log that captures if any errors were encountered with the script during execution.
    2. second file captures any inconsistencies in data for any respondents, as the script 

    The subsequent data checks are done using the same script against fresh datasets over time, as more respondents' data is collected, until the survey is closed in the field.

    Goal

    We are aiming to automate the subsequent rounds of data check, once the initial (Soft launch) checks are completed for a survey. We can save on the efforts involved with pulling data files, executing the file manually and fetching the output from the two files. 

    Challenges

    1. We can schedule data exports from data collection platforms, at specific time / regular intervals. However, with SPSS everything is tied to the user (perhaps, due to their licensing). So, even if the task is scheduled at a specific time, it needs the user (client) to be connected to the SPSS server at that time.
    2. The output files are saved in the user's working directory, which requires the user to login and access those files.
    3. This restricts our ability to perform the checks over the weekend or user's off-shift hours, if required.
    4. Additionally, the user is also required to be connected to the VPN, which slows them down when they may have something to work in parallel.
    5. SPSS needs the data files to be hosted locally or on a network, but to us, seems like doesn't support cloud-hosted data files.

    What are we looking for

    1. Based on the use case specified above –
      1. We need to find out from IBM SPSS support, if they already have a solution in place to address this.
      2. If they don't then can they provide us some directions on how we can accomplish this?
    2. Browsing through some SPSS documentation that I was able to access on internet, SPSS supports "Batch mode" processing which could run in "Detached mode", meaning the user can initiate the action but could log off and the execution would still happen. We'd ideally need some more insights on how this works.
    3. We'd need a virtual system that hosts SPSS Server, so we can play around with configuration / settings and not disrupt any LIVE production jobs.

    If there is already have a solution (first option), then we may not need options 2 & 3....SPSS Modeler? not sure options 



    ------------------------------
    Christina Mitchell
    ------------------------------


  • 2.  RE: Do I need Spss Statistics Server?

    Posted Thu November 23, 2023 10:53 AM
    a lot depends on the details, but I would point you to a few tools that could help.  You have already discovered the production mode.  Using OS facilities, you could schedule a batch job to run at scheduled intervals, perhaps on a dedicated PC or with Statistics Server.

    There is an extension command, SPSSINC PROCESS FILES, that could pick up newly arrived files and run through SPSS jobs on them.  This can be installed via the Extension Hub.  You may already have it, but it was updated to simplify usage a while ago, so you should update it if you have the older version.  A usage tutorial is installed along with it.

    We could discuss details offline if that would be helpful.





  • 3.  RE: Do I need Spss Statistics Server?

    Posted Thu November 23, 2023 10:55 AM
    P.s.  SPSS, at least v29, can access files on Microsoft OneDrive.--





  • 4.  RE: Do I need Spss Statistics Server?

    Posted Fri November 24, 2023 10:54 AM

    It appears that my last reply did not post.  I could do a call today (Friday) or next week.  Please use my regular email (jkpeck@gmail.com) to continue the discussion.



    ------------------------------
    Jon Peck
    ------------------------------