KIWI - as a graduate student: 12月 2015

2015年12月31日星期四

心理計量的"錯誤"或"不同概念"之paper searching

Bland altman

1. Yang KG, Raijmakers NJ, Verbout AJ, Dhert WJ, Saris DB. Validation of the short-form womac function scale for the evaluation of osteoarthritis of the knee. J Bone Joint Surg Br. 2007;89:50-56

說明:

Bland altman使用方法不同: 同時用於信度與效度的驗證 (SF WOMAC FS之再測信度、SF WOMAC和 modified WOMAC與tradional版之收斂效度)，且主要非由此方法看一致性程度而是藉由視覺呈現Bland altman plot 來看是否有 heteroscedasticity，進而判斷工具評量之準確度是否會受到症狀嚴重度之影響。

問題之處 : 1. 沒有external criteria

2. 僅看收斂效度證據薄弱 ?

Predictive validity

2. Czaikowski BL, Liang H, Stewart CT. A pediatric four score coma scale: Interrater reliability and predictive validity. J Neurosci Nurs. 2014;46:79-87

說明:

本篇驗證PFSS之預測效度，外在效標為PCPC(看functional outcomes)。預測效度之概念也是看工具預測功能(包括outcomes和 mortality)之能力，但採用之方法卻非使用皮爾森相關而是用ROC曲線，看曲線下面積AUC大小來判斷工具診斷出poor outcome之效能(分成分數1-3及4-6兩類, 4-6為poor outcome)。

問題之處 :研究目的是想看工具預測差的outcomes和院內死亡的效能，因為研究設計是同時驗證(在admission和discharge)，似乎就不太能符合研究目的(預測未來功能表現)。

ecological validity

3. Junghaenel DU, Schneider S, Stone AA, Christodoulou C, Broderick JE. Ecological validity and clinical utility of patient-reported outcomes measurement information system (promis(r)) instruments for detecting premenstrual symptoms of depression, anger, and fatigue. J Psychosom Res. 2014;76:300-306

說明:

本研究驗證PROMIS-CAT用於偵測經前症候群婦女憤怒、憂鬱、疲累症狀之生態效度，生態效度概念一樣是看測驗分數是否能反映出於日常生活之表現，方法為看PROMIS-CAT及corresponding daily measure之相關。

疑問之處：

１.文章驗證生態效度方法有二如下ａ、ｂ，然不知驗證（ａ）之意義何在　

(a) differences in mean symptom levels on CAT and daily　short forms

(b) correlations between CATs and daily scores for　each week.

２. 生態效度計算Baseline　week、Pre-menstrual　week、Menstrual　week、Post-menstrual　week及Pooled　across　weeks，未針對premenstrual，另外在這四週，症狀會有不同的變化，然卻同時用來驗證生態效度，是否會不當。

2015年12月28日星期一

Responsiveness reading 整理

Individual level 指標 :(偵測個別個案前後測是否有差異之能力)

1. 以總分改變量分成三等級，比較各等級人數比例
(a)超過 MDC95：代表功能進步具備統計顯著
(b)0 ~ MDC95：代表未進步或進步量未具備統計顯著
(c)低於 0：退步

2.相對風險(relative risk, RR) : 驗證兩個量表於各等級人數之相對比值以及是
否具備顯著差異[比較兩工具]

3.最小可偵測變化值比值: 總分之改變量除以個別量表之最小可偵測變化值

Group level 指標: (評估工具呈現團體前後測改變量之能力)

1. 效應值 d (effect size d)
2. 標準化反應平均值 (standardized response mean)
3. 配對 t 檢定 : 可以看改變是否有顯著差異or [比較兩工具] 總分於前後二次測量之平均分數是否有顯著差異

P.S. MDC為sample independent，可類推個案

* 以Tests of Marginal Homogeneity(卡方檢定X2 /G2) 檢驗兩工具三等級的資料分佈是否相同

2015年12月13日星期日

區辨效度部分研究

議題

1.慢性病患可行性較高 (急性組取消)

2. CGI-S或PANSS使用

根據資料統計:

心湖&文山坊CGI-S統計結果顯示大部分集中在level 1-2

[pretest]
CGI-S 一分 40人
CGI-S 二分 26人
CGI-S 三分 9人

[postest]
CGI-S 一分 49人
CGI-S 二分 20人
CGI-S 三分 4人

(59/73人前後分數一致)

* 慢性病患CGI-S差異不大，仍可以使用CGI-S，但只能預期差異大概不顯著

* 根據莞薈學姊之說明，約 1月會去松德說明收案可以一起去，但若需使用PANSS則需跟她一起收，學姊約5、6月才開始收案

問題

1. 這樣看起來是採用CGI-S較好嗎 ? 還是要跟學姊收才有辦法呢?

2. 是否加入正常組 ?

2015年12月11日星期五

流程問題確認

1. 急性組收案條件之一: 已開始1~2周之精神科職能治療 ?

考量 :

1. 病人症狀 (evidence : 4~6星期可有效控制，急性發作6個月逐漸穩定)，和慢性組區隔

2. 適合評估---個案治療師評估?

3. 案量 : 未出院

學者葉玲玲、藍忠孚與鄭若瑟（2003）利用國家衛生研究院全民健保資料庫抽樣 10 萬人，分析顯示：精神分裂症患者的急性病房平均住院日數是 37.6 天，各醫院層級中，以醫學中心急性病房平均住院日數最長（55.7 天），其次為區域醫院（41.1 天），再其次為地區醫院（22.3 天）。

2. 施測者間信度: 驗證2個測驗 ? 量表 (ICC) 或項目(KAPPA) 的選擇?

3. 使用MMSE

2015年12月6日星期日

心理計量文獻閱讀問題-單向度

1. Guttman scales相加之overall scores是否有意義

2. Rasch是否不適合驗證單向度?欲轉成interval scale才用rasch?若只為確認單向度驗證Mokken?

Guttman scales是更簡易之指標?

3. 何謂”IRFs of the scale do not intersect?"

4. Mokken指標The scalability coefficient H : a global indicator of the degree to which patients can be accurately ordered on the upper extremity function by means of their sum scores à此sum scores 只能做比較用，而不看能力差異之大小?

5. 何謂Intermodel agreement?

-----------------------------------------------------------------------------------------------------

Points:

Guttman scale

由單向或具有同一性質的項目所組成，項目由弱到強連續性排序

Standard:(to be a valid cumulative and unidimensional Guttaman model)

Coefficient of Reproducibility(CR): how well the observed responses corresponded to the responses that would have been predicted if the items were perfectly ordered. (CR>.90)

Coefficient of Scalability(CS): the proportion of responses which can be correctly predicted from the total score (CS>.60)

Mokken analysis

1. an non-parametric item response theory model

2. examine accuracy of ordering between person’s raw sum scores

3. need sample size>200

4. standard: MH modal:The scalability coefficient H>0.5

DM modal: adequate: largest Crit value per item <40

violated: both Crit values for an item>80

Rasch analysis

1. parametric item response theory model

2. examine whether sum scores of a measure can be transformed into interval scores.

(interval property can quantify change and do arithmetic operations)

3. Limitation: exclude items whose scores cannot be transformed into an interval scale, but which do fit the Mokken model’s expectations

2015年12月2日星期三

sharing from class: CI &Statistical Significance

信賴區間與統計顯著性之關係: 不一定等價

1. 兩平均值CI不重疊，則兩平均值具顯著差異
2. 兩平均值CI重疊，則說兩平均值不具顯著差異"不一定正確"

例如 :兩獨立樣本

first sample mean =9
second sample mean =17
SE相同=2.5

95% CI of first sample =(4.1, 13.9)
95% CI of second sample=(12.1, 21.9)
兩個信賴區間有重疊

t-檢定(comparing two means) : T=(17-9)/√(2.5*2.5+2.5*2.5)=2.26 拒絕虛無假設,兩平均值具顯著差異

在解釋數據時需要小心~~~~

_______________________________________________

tennis class :
開始嘗試雙打比賽、學習計分規則，從比賽中可以知道需要練習的部分-->發球真的很重要，因為發球的區域不大很容易就出界，需要準確控制。

完整的發球動作需要 :重心往前---> 往後 ---> 往前
(右手後劃弧,左手拋球)
*菜刀持拍(same as羽球正拍)
*球放手指與掌面交界處，thumb在上面輕扣
*球拋到右上方，身體需朝球之去向

動作越好，攻擊性越強，但因為重心移動的關係，這種動作比先拉好拍、直接拋打來的難許多!

2015年12月31日 星期四